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(57) Abstract 



This invention provides isolated polynucleotides encoding DNA polymerases of three members of a subfamily of gamma herpes 
viruses. Two were obtained from macaque monkeys affected with retroperitoneal fibromatosis, the other from human AIDS patients 
affected with Kaposi's sarcoma. A 454-base pair fragment encoding a region near the active site of the DNA polymerase is 69-83 % 
identical amongst the three viruses, but only 54-68 % identical with other known gamma herpes sequences and < 55 % identical with 
alpha and beta herpes sequences. Also provided are ploynucleotides encoding DNA polymerase from related viruses in the RFHV/KSHV 
subfamily. Polynucleotides prepared according to the sequence data can be used as reagents to detect and characterize related sequences. 
Such reagents may be used to detect members of the RFHV/KSHV subfamily, including but not limited to RFHV, RFHV2, and KSHV. 
Corresponding polypeptides and peptide fragments may be obtained by expressing the polynucleotide or by chemical synthesis. They may 
be used for detecting specific antibody potentially present in the serum of infected subjects. They may also be used for designing or 
screening pharmaceutical compounds that limit viral replication by inhibiting DNA polymerase activity. 
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DNA Polymerase of Gamma Herpes Viruses Associated with Kaposi's 

Sarcoma and Retroperitoneal Fibromatosis 

Cross-Reference to Related Applications 

This application is a continuation-in-part of U.S. provisional patent application Serial No. 60/001,148, 
filed July 14, 1995, and pending U.S. patent application Serial No. [pending, attorney docket 29938- 
20001 .00], filed July 11,1 996, which are hereby incorporated herein by reference in their entirety. 

Statement of Rights to Inventions made under 
Federally Sponsored Research 

This invention was made in part during work supported by a grant from the National Institutes of 
Health (RR00166-34). The Government has certain rights in the invention. 

Technical Field 

The present invention relates generally to the field of virology, particularly viruses of the herpes 
family. More specifically, it relates to the identification and characterization of DNA polymerase in a virus 
subfamily, members of which are associated with fibroproliferative and neoplastic conditions in primates, 
including humans. 

Background art 

Kaposi's Sarcoma is a disfiguring and potentially fatal form of hemorrhagic sarcoma. It is 
characterized by multiple vascular tumors that appear on the skin as darkly colored plaques or nodules. At 
the histological level, it is characterized by proliferation of relatively uniform spindle-shaped cells, forming 
fascicles and vascular slits. There is often evidence of plasma cells, T cells and monocytes in the 
inflammatory infiltrate. Death may ultimately ensue due to bleeding from gastrointestinal lesions or from an 
associated lymphoma. (See generally Martin et al., Finesmith et al.) 

Once a relatively obscure disease, it has leapt to public attention due to its association with AIDS. As 
many as 20% of certain AIDS-affected populations acquire Kaposi's during the course of the disease. 
Kaposi's Sarcoma occurs in other conditions associated with immunodeficiency, including kidney dialysis 
and therapeutic immunosuppression. However, the epidemiology of the disease has suggested that 
immunodeficiency is not the only causative factor. In particular, the high degree of association of Kaposi's 
with certain sexual practices suggests the involvement of an etiologic agent which is not the human 
immunodeficiency virus (Berel et al.). 

A herpes-virus-like DNA sequence has been identified in tissue samples from Kaposi's lesions 
obtained from AIDS patients (Chang et al., confirmed by Ambroziuk et al.). The sequence was obtained by 
representational difference analysis (Lisitsyn et al.), in which DNA from affected and unaffected tissue 
were amplified using unrelated priming oligonucleotides, and then hybridized together to highlight 
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differences between the cells. The sequence was partly identical to known sequences of the Epstein Ban- 
Virus and herpesvirus saimiri. It coded for capsid and tegument proteins, two structural components. In a 
survey of tissues from various sources, the sequence was found in 95% of Kaposi's sarcoma lesions, 
regardless of the patients' HIV status (Moore et al.). 21% of uninvolved tissue from the same patients was 
positive, while 5% of samples from a control population was positive. There was approximately 0.5% 
sequence variation between samples. The sequence was also detected at a higher copy number in body 
cavity lymphoma, a lymphomatous effusion with a B-cell genotype occurring uniquely in AIDS patients 
(Cesarman et al.). Other AIDS-associated lymphomas were negative. 

The herpes virus family comprises a number of multi-enveloped viruses about 100 nm in size, and 
capable of infecting vertebrates. (For general reviews, see, e.g., Emery et al., Fields et al.). The 
double-stranded DNA genome is unusually large - from about 88 to about 229 kilobases in length. It may 
produce over 50 different transcripts at various stages in the life cycle of the virus. In one of the stages, a 
number of nucleotide and polynucleotide processing enzymes are produced that are required for viral 
replication, including DNA polymerase, DNAse, dUTPase, ribonucleotide reductase, uracil-DNA 
glycosylase, and thymidine kinase. These functional proteins tend to be relatively well conserved between 
species, compared with external viral components (Karlin et al.). 

The herpes virus family has been divided into several subfamilies. Assignments to each of the 
categories were originally based on the basis of biologic properties, and are being refined as genomic 
sequence data emerges. The alpha subfamily comprises viruses that have a broad host range, a short 
replicative cycle, and an affinity for the sensory ganglia. They include the human simplex virus and the 
Varicella-zoster virus. The beta subfamily comprises viruses that have a restricted host range, and include 
Cytomegalovirus and human Herpes Virus 6. The gamma subfamily comprises viruses that are generally 
lymphotrophic. The DNA is marked by a segment of about 110 kilobases with a low GC content, flanked 
by multiple tandem repeats of high GC content. The subfamily includes Epstein Barr Virus (EBV), herpes 
virus saimiri, equine Herpes Virus 2 and 5, and bovine Herpes Virus 4. 

Herpes viruses are associated with conditions that have a complex clinical course. A feature of many 
herpes viruses is the ability to go into a latent state within the host for an extended period of time. Viruses 
of the alpha subfamily maintain latent forms in the sensory and autonomic ganglia, whereas those of the 
gamma subfamily maintain latent forms, for example, in cells of the lymphocyte lineage. Latency is 
associated with the transcription of certain viral genes, and may persist for decades until conditions are 
optimal for the virus to resume active replication. Such conditions may include an immunodeficiency. In 
addition, some herpes viruses of the gamma subfamily have the ability to genetically transform the cells 
they infect. For example, EBV is associated with B cell lymphomas, oral hairy leukoplakia, lymphoid 
interstitial pneumonitis, and nasopharyngeal carcinoma. 

A number of other conditions occur in humans and other vertebrates that involve fibroproliferation and 
the generation of pre-neoplastic cells. Examples occurring in humans are retroperitoneal fibrosis, nodular 
fibromatosis, pseudosarcomatous fibromatosis, and sclerosing mesenteritis. Another condition known as 
Enzootic Retroperitoneal Fibromatosis (RF) has been observed in a colony of macaque monkeys at the 
University of Washington Regional Primate Research Center (Giddens et al.). Late stages of the disease 
are characterized by proliferating fibrous tissue around the mesentery and the dorsal part of the peritoneal 
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cavity, with extension into the inguinal canal, through the diaphragm, and into the abdominal wall. Once 
clinically apparent, the disease is invariably fatal within 1-2 months. The condition has been associated 
with simian immunodeficiency (SAIDS) due to a type D simian retrovirus, SRV-2 (Tsai et aL). However, 
other colonies do not show the same frequency of RF amongst monkeys affected with SAIDS, and the 
5 frequency of RF at Washington has been declining in recent years. 

The study of such conditions in non-human primates is important not only as a model for human 
conditions, but also because one primate species may act as a reservoir of viruses that affect another 
species. For example, the herpes virus saimiri appears to cause no disease in its natural host, the squirrel 
monkey (Saimiri sciureus), but it causes polyclonal T-cell lymphomas and acute leukemias in other 
1 0 primates, particularly owl monkeys. 

There is a need to develop reagents and methods for use in the detection and treatment of herpes 
virus infections. 

For example, there is a need to develop reagents and methods which can be used in the diagnosis 
and assessment of Kaposi's sarcoma, and similar conditions. Being able to detect the etiologic agent in a 

1 5 new patient may assist in differential diagnosis; being able to assess the level of the agent in an ongoing 
condition may assist in clinical management. The tegument encoding polynucleotide of Chang et al. may 
have limited applicability in this regard. It is desirable to obtain a marker capable of distinguishing active 
from latent infection. It is also desirable to obtain a marker that is immunogenic, and can be used to 
assess immunological exposure to the agent as manifest in the antibody response. 

20 Second, there is a need to develop reagents and methods which can be used in the development of 

new pharmaceuticals for Kaposi's sarcoma, and similar conditions. The current treatment for Kaposi's is 
radiation in combination with traditional chemotherapy, such as vincristine (Northfelt, Mitsuyasu). While 
lesions respond to these modalities, the response is temporary, and the downward clinical course 
generally resumes. Even experimental therapies, such as treatment with cytokines, are directed at the 

25 symptoms of the disease rather than the cause. Drug screening and rational drug design based upon the 
etiologic agent can be directed towards the long-felt need for a clinical regimen with long-term efficacy. 

Third, there is a need to develop reagents and methods which can be used to identify viral agents 
that may be associated with other fibroproliferative conditions. The representational difference analysis 
technique used by Chang et al. is arduously complex, and probably not appropriate as a general screening 

30 test. More desirable are a set of primers or probes to be used as reagents in more routine assays for 
surveying a variety of tissue samples suspected of containing a related etiologic agent. Preferably, the 
reagents are sufficiently cross-reactive to identify previously undescribed viral compounds, but sufficiently 
specific to avoid identifying unwanted viruses or endogenous components of the host. 

35 Disclosure of the invention 

It is an objective of this invention to provide isolated polynucleotides, polypeptides, and antibodies 
derived from or reactive with the products of novel DNA polymerase genes. The genes are present in 
herpes viruses associated with fibroproliferative conditions and neoplasms, especially those that occur in 
40 humans and non-human primates. Another objective of this invention is to provide polynucleotide primers 
and probes for detecting and characterizing DNA polymerase genes in any member of the herpes virus 
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family, especially the gamma herpes subfamily. Another object of this invention is to provide materials and 
methods based on these polynucleotides, polypeptides, and antibodies for use in the diagnosis and 
treatment of gamma herpes virus infection in primates, particularly humans. 
Embodiments of the invention include the following: 

• An isolated polynucleotide with a region encoding a DNA polymerase of a herpes virus, the 
polynucleotide comprising a sequence (preferably 475 nucleotides long) that is at least 69% identical 
to nucleotides 27 to 501 of a sequence selected from the group consisting of SEQ. ID NO:1 and SEQ. 
ID NO:3. 

• An isolated polynucleotide comprising a fragment of at least 18, more preferably at least about 35, still 
more preferably at least about 50 consecutive nucleotides of the DNA polymerase encoding region of 
the polynucleotide of the previous embodiment, wherein the sequence of said fragment is not 
contained in SEQ. ID NOS:110 or 111. Preferred examples are isolated polynucleotides comprising a 
fragment of at least 18 consecutive nucleotides contained in SEQ. ID NOS:1, 3, 1 16, or 118. 

• An isolated polynucleotide with a region encoding a DNA polymerase of a herpes virus, the 
polynucleotide comprising a sequence of 26 nucleotides at least 80% identical to oligonucleotide 
LSGGA (SEQ. ID NO: 107). 

• An isolated polynucleotide with a region encoding a DNA polymerase of a herpes virus, the 
polynucleotide comprising a sequence of 29 nucleotides at least 69% identical to oligonucleotide 
CTDPA (SEQ. ID NO: 108). 

• An isolated polynucleotide with a region encoding a DNA polymerase of a herpes virus, the 
polynucleotide comprising a sequence of 32 nucleotides at least 80% identical to oligonucleotide 
KMLEA (SEQ. ID NO:22). 

• An isolated polynucleotide with a region encoding a DNA polymerase of a herpes virus, the 
polynucleotide comprising a sequence of 29 nucleotides at least 69% identical to oligonucleotide 
GISPA(SEQ. ID NO:109). 

• An isolated polynucleotide comprising a fragment of at least 18, more preferably at least about 35, 
more preferably at least about 50 consecutive nucleotides of the DNA polymerase encoding region of 
the polynucleotide of the previous embodiments, wherein the sequence of said fragment is not 
contained in SEQ. ID NOS:1 1 0 or 1 1 1 . 

• The polynucleotide of any of the previous embodiments, wherein said herpes virus is capable of 
infecting primates. Preferred examples are RFHV, KSHV, and RFHV2. 

• An isolated polynucleotide comprising a linear sequence of at least 18 nucleotides identical to a 
linear sequence between nucleotides 27 to 501 inclusive of SEQ. ID NO:1, or between nucleotides 
27 to 501 inclusive of SEQ. ID NO:3, or between nucleotides 36 to 2499 inclusive of SEQ. ID 
NO:116, or between nucleotides 1 to 454 inclusive of SEQ. ID NO:118, but not to a linear sequence 
within either SEQ. ID NO:110 or SEQ. ID NO:111. The isolated polynucleotide preferably 
comprises a linear sequence essentially identical to nucleotides 27 to 501 of SEQ. ID NO:1, or to 
nucleotides 27 to 501 of SEQ. ID NO:3, or to nucleotides 36 to 2499 of SEQ. ID NO:116, or to 
nucleotides 1 to 454 of SEQ. ID NO:118. 

• An isolated polypeptide encoded by any of the polynucleotides embodied in this invention. 
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• An isolated polypeptide, comprising a linear sequence of at least 11, preferably 12, and more 
preferably 15 amino acids essentially identical to a sequence between amino acids 10 to 167 inclusive 
of SEQ. ID NO:2 or between amino acids 10 to 167 inclusive of SEQ. ID NO:4 or between amino 
acids 13 to 833 inclusive of SEQ. ID NO:117, or in any of SEQ. ID NOS:1 19-123, but which is not 

5 contained in SEQ. ID NOS:1 12 or in SEQ. ID NO:1 13. 

• A fusion polypeptide comprising the amino acid sequence of an isolated peptide of the previous 
embodiment, joined to a second amino acid sequence. 

• The isolated polypeptide of the previous embodiment, which has nucleic acid binding activity, 
nucleotide binding activity, or DNA polymerase activity. 

10 • An isolated polypeptide, comprising a linear sequence of amino acids identical to a sequence selected 
from the group consisting of SEQ. ID NOS:80, 82, 84, 86, 88, and 90 to 103. 

• An isolated polynucleotide encoding a polypeptide embodied in this invention. 

• A non-naturally occurring polynucleotide encoding a polypeptide embodied in this invention. 

• A polynucleotide encoding a fusion polypeptide, comprising the polynucleotide embodied in this 
1 5 invention joined directly to a second polynucleotide encoding a polypeptide. 

• A recombinant cloning vector comprising a polynucleotide sequence encoding a polypeptide of at 
least 11, preferably at least 12, more preferably at least 15 consecutive amino acids between amino 
acids 10-167 inclusive of SEQ. ID NO:2, or between amino acids 10-167 inclusive of SEQ. ID NO:4, or 
between amino acids 13-833 inclusive of SEQ. ID NO:117, or in any of SEQ. ID NOS:1 19-123, but not 

20 contained in SEQ. ID NO:112 or SEQ. ID NO:113. 

• A recombinant expression vector comprising a polynucleotide sequence encoding a polypeptide of at 
least 1, preferably at least 12, more preferably at least 15 consecutive amino acids between amino 
acids 10-167 inclusive of SEQ. ID NO:2, or between amino acids 10-167 inclusive of SEQ. ID NO:4, or 
between amino acids 13-833 inclusive of SEQ. ID NO:117, or in any of SEQ. ID NOS:1 19-123, but not 

25 contained in SEQ. ID NO:112 or SEQ. ID NO:113, operatively linked to a control polynucleotide 

sequence. 

• A recombinant cloning vector comprising a linear sequence of at least 18 nucleotides identical to a 
linear sequence within SEQ. ID NOS:1, 3, 116, or118, but not in SEQ. ID NOS:110 or 111. 

• A host cell genetically altered by any of the polynucleotides, cloning vectors, or expression vectors of 
30 this invention. 

• A monoclonal or isolated polyclonal antibody specific for a DNA polymerase encoded in said encoding 
region of a polynucleotide of this invention. 

• A monoclonal or isolated polyclonal antibody specific for a peptide of this invention. 

• An oligonucleotide essentially identical to an oligonucleotide selected from the group consisting of 
35 SEQ. ID NOS:5to 16, 21, 22, 104-109, and 124-152. 

• A method of obtaining an amplified copy of a polynucleotide encoding a DNA polymerase, comprising 
the steps of contacting the polynucleotide with an oligonucleotide of this invention, which may be a 
Type 1 , Type 2, or Type 3 oligonucleotide, and elongating oligonucleotide that has formed a duplex 
with the polynucleotide. Preferably, the amplification reaction is a polymerase chain reaction (PCR). 
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The PCR preferably comprises repeated cycles of annealing and elongating, and the annealing is 
conducted at a temperature of at least 60°C. 

• The PCR is preferably conducted in a buffer containing 10-30 mM (NH 4 ) 2 S0 4 and 1-10 mM MgCI 2 ., 
such as WB4 buffer. 

5 • A method of obtaining an amplified copy of a polynucleotide encoding a DNA polymerase, wherein the 
polynucleotide which is amplified is first obtained from a biological sample taken from an individual 
affected with a disease featuring fibroblast proliferation and collagen deposition. 

• A method of obtaining an amplified copy of a polynucleotide encoding a DNA polymerase, wherein the 
polynucleotide which is amplified is first obtained from a biological sample taken from an individual 

10 affected with a malignancy of the lymphocyte lineage. Also included is a method wherein the 

polynucleotide which is amplified is first obtained from a biological sample taken from an individual 
affected with a condition selected from the group consisting of retroperitoneal fibrosis, nodular 
fibromatosis, pseudosarcomatous fibromatosis, fibrosarcoma, sclerosing mesenteritis, acute 
respiratory disease syndrome, idiopathic pulmonary fibrosis, diffuse proliferative glomerulonephritis, 

15 glioma, glioblastomas, gliosis, leukemia and lymphoma. 

• A method of detecting viral DNA or RNA in a sample of primate origin, comprising the steps of: 
contacting the DNA or RNA in the sample with a probe comprising the polynucleotide of this invention, 
under conditions that would permit the probe to form a stable duplex with a polynucleotide having the 
sequence shown in SEQ. ID NO:1, and with a polynucleotide having the sequence shown in SEQ. ID 

20 NO:3, but not with a polynucleotide having a sequence of any of SEQ. ID NOS:24 to 29; and detecting 

the presence of said stable duplex formed in step a), if any. The conditions referred to are a single set 
of reaction parameters, such as incubation time, temperature, solute concentrations, and washing 
steps, that fulfills all the criteria listed. Under these conditions, the polynucleotide would be capable of 
forming a stable duplex if contacted with a polynucleotide having SEQ. ID NO:1. It would also be 

25 capable of forming a stable duplex if contacted with a polynucleotide having SEQ. ID NO:3. It would 

not be capable of forming a stable duplex if contacted with a polynucleotide having a sequence of any 
of SEQ ID NO:24 to SEQ. ID NO:29. The reaction conditions may optionally be tested by contacting 
with the polynucleotides consisting only of the sequences indicated, or by contacting with 
polynucleotides with the sequences indicated linked to additional nucleotides, so long as formation of 

30 a stable duplex under the test conditions relies on the sequence indicated. Also included are similar 

methods using other polynucleotides of this invention. This includes conducting an amplification 
reaction on the DNA or RNA of the sample prior to being contacted with the probe. The amplification 
reaction may be conducted using an oligonucleotide primer of this invention. 

• A method of detecting viral DNA or RNA in a sample of primate origin, comprising the steps of: 
35 contacting the DNA or RNA in the sample with an oligonucleotide probe comprising a sequence 

shown in SEQ. ID NOS: 21 , 22, 1 07, 1 08, or 1 09, under conditions that would permit the probe to form 
a . stable duplex with a polynucleotide having the sequence shown in SEQ. ID NO:1, and with a 
polynucleotide having the sequence shown in SEQ. ID NO:3, but not with a polynucleotide having a 
sequence of any of SEQ. ID NOS:24 to 29; and detecting the presence of said stable duplex formed, if 
40 any. 
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• A method of detecting viral DNA or RNA in a sample, comprising the steps of: contacting the DNA or 
RNA in the sample with an oligonucleotide probe comprising a sequence shown in SEQ. ID NOS:22, 
107, 108 or 109 under conditions that would permit the probe to form a stable duplex with a 
polynucleotide having the sequence shown in SEQ. ID NO:1, and with a polynucleotide having the 

5 sequence shown in SEQ. ID NO:3, but not with a polynucleotide having a sequence of any of SEQ. ID 

NOS:23 to 29; and detecting the presence of said stable duplex formed f any. 

• A method of detecting viral DNA or RNA in a sample, comprising the steps of: conducting an 
amplification reaction on a polynucleotide in the sample using the oligonucleotide of this invention as a 
primer in the reaction; and detecting the presence of amplified copies of the polynucleotide, if any. 

10 • An isolated polynucleotide capable of forming a stable duplex with an oligonucleotide comprising a 
sequence selected from the group consisting of SEQ. ID NO: 107, SEQ. ID NO: 108, and their 
respective complementary sequences, under conditions wherein the oligonucleotide is capable of 
forming a stable duplex with a polynucleotide having the sequence shown in SEQ. ID NO:1 , and with a 
polynucleotide having the sequence shown in SEQ. ID NO:3, but not with a polynucleotide having a 

15 sequence of any of SEQ. ID NOS:23 to 29.An isolated polypeptide comprising a linear sequence of at 

least 11 amino acids, preferably at least 12 amino acids, more preferably at least 15 amino acids 
encoded within the polynucleotide of the preceding embodiment. 

• A method for detecting infection of an individual by a herpes virus, comprising detecting viral DNA or 
RNA in a biological sample obtained from the individual, wherein the detecting of viral DNA or RNA is 

20 by the method of embodied in this invention. Also included is a method for detecting infection of an 

individual by a herpes virus, comprising detecting viral DNA or RNA in a biological sample obtained 
from the individual, wherein the detecting of viral DNA or RNA is by the method of: a) contacting the 
DNA or RNA in the sample with a probe comprising the polynucleotide of this invention under 
conditions that would permit the probe to form a stable duplex with a polynucleotide having at least 

25 one sequence selected from the group consisting of SEQ. ID NOS:1, 3, 116, or 118, but not with 

polynucleotides having a sequence of any of SEQ. ID NOS:24 to 29; and b) detecting the presence of 
said stable duplex formed in step a), if any. Also included is a method for detecting infection of an 
individual by a herpes virus, comprising detecting viral DNA or RNA in a biological sample obtained 
from the individual, wherein the detecting of viral DNA or RNA is by the method of: a) contacting the 

30 DNA or RNA in the sample with a probe comprising the polynucleotide of this invention under 

conditions that would permit the probe to form a stable duplex with a polynucleotide having a 
sequence shown in SEQ. ID NO:116, but not with polynucleotides having a sequence of any of SEQ. 
ID NOS:24 to 29; and b) detecting the presence of said stable duplex formed in step a), if any. 

• A diagnostic kit for detecting a herpes virus polynucleotide in a biological sample, comprising a 
35 reagent in suitable packaging, wherein the reagent comprises a polynucleotide of this invention 

• A diagnostic kit for detecting a herpes virus polynucleotide in a biological sample, comprising a 
reagent in suitable packaging, wherein the reagent comprises the oligonucleotide of this invention. 

• A method of detecting infection of an individual by a herpes virus, comprising the steps of: contacting 
antibody from a sample obtained from the individual with the polypeptide of this invention under 
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conditions that permit the formation of a stable antigen-antibody complex; and detecting said stable 
complexes formed , if any. 

• A diagnostic kit for detecting an anti-herpesvirus antibody present in a biological sample, comprising a 
reagent in suitable packaging, wherein the reagent comprises a polypeptide of this invention. 

5 • A method of detecting infection of an individual by a herpes virus, comprising the steps of: contacting 
antibody from a sample obtained from the individual with a polypeptide of this invention under 
conditions that permit the formation of a stable antigen-antibody complex; and detecting said stable 
complexes formed, if any. 

• A diagnostic kit for detecting an anti-herpesvirus antibody present in a biological sample, comprising a 
1 0 reagent in suitable packaging, wherein the reagent comprises a polypeptide of this invention. 

• A method of detecting infection of an individual by a herpes virus, comprising the steps of: contacting 
a polypeptide from a sample obtained from the individual with the antibody of this invention under 
conditions that permit the formation of a stable antigen-antibody complex; and detecting said stable 
complexes formed, if any. 

15 • A diagnostic kit for detecting a herpes virus polypeptide present in a biological sample, comprising a 
reagent in suitable packaging, wherein the reagent comprises an antibody of this invention. 

• A composition for use in the treatment of herpes virus infection, comprising a polynucleotide, 
polypeptide, or antibody of this invention. 

• A method of determining whether a pharmaceutical candidate is useful for treating gamma herpes 
20 infection, comprising the steps of: contacting a peptide of this invention with the pharmaceutical 

candidate; and determining whether a biochemical function of the polypeptide is altered by the 
pharmaceutical candidate. The biochemical function of the polypeptide determined may be the 
binding of the polypeptide to a nucleic acid., or DNA polymerase activity. 

• A method of determining whether a pharmaceutical candidate is useful for treating gamma herpes 
25 infection, comprising the steps of: genetically altering a cell using a polynucleotide of this invention, 

and determining the effect of the pharmaceutical candidate on the cell in comparison with a cell not 
genetically altered with the polynucleotide. 

• A method of obtaining a compound for use in treating an individual infected with herpes virus, 
comprising the steps of: creating a compound capable of binding a region of the polypeptide of this 

30 invention involved in interacting with a nucleic acid; and determining whether the compound interferes 

with a biochemical function of the polypeptide. 

Brief Description of the Drawings 

35 Figure 1 is a listing of polynucleotide sequences amplified from a DNA polymerase encoding region 

of RFHV and KSHV, along with the encoded polypeptides. The 475-base fragment of each polynucleotide 
between primers DFASA and GDTD1B is underlined. Also shown in lower-case letters are 
oligonucleotides useful as amplification primers aligned with corresponding regions of the DNA 
polymerase gene. DFASA, VYGA and GDTD1B are oligonucleotides with consensus and degenerate 

40 segments that can be used to amplify any herpes virus DNA polymerase gene. LSGGA, CTDPA, PCLNA, 
KMLEA and GISPA are oligonucleotides specific for the RFHV/KSHV subfamily of herpes viruses. 
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VASGA, ILPCA, PIEAB and PEARB are RFHV-specific primers. SGILA, CLNIA, IEASB and EARFB are 
KSHV-specific primers. Oligonucleotides that initiate amplification in the direction of the coding sequence 
(with designations ending in u A n ) are listed 5'-»3'. Oligonucleotides that initiate amplification in the 
direction opposite to that of the coding sequence (with designations ending in "B") are listed 3'->5\ to show 
5 alignment with the corresponding sequences in the RFHV and KSHV polynucleotide. 

Figure 2 is a listing of the previously known polypeptide sequences of other herpes virus DNA 
polymerases, showing regions that are relatively conserved between species. 

Figure 3 is a listing of previously known polynucleotide sequences of herpes viruses near conserved 
REGION 2, showing the alignment of oligonucleotides DFASA and DFQSA with the sequences from which 
1 0 they were designed. 

Figure 4 is a listing of previously known polynucleotide sequences of herpes viruses near conserved 
REGION 3, showing the alignment of oligonucleotides VYGA, VYGCA and VYGSQA with the sequences 
from which they were designed. 

Figure 5 is a listing of previously known polynucleotide sequences of herpes viruses near conserved 
15 REGION 1, showing the alignment of oligonucleotides GDTD1B and GDTDSQB with the sequences from 
which they were designed. 

Figure 6 is a listing comparing the polynucleotide sequences of DNA polymerase of the gamma 
herpes virus subfamily. The fragment shown is the 475 base pairs between the hybridizing site of DFASA 
and GDTD1B. 

20 Figure 7 is a listing comparing polypeptide sequences of DNA polymerase for the same viruses over 

the same fragment as Figure 6. This figure also shows examples of possible antibody binding regions, 
including those which are specific for RFHV, KSHV, or the RFHV/KSHV subfamily. 

Figure 8 is a comparison of the polypeptide sequence for the fragment encoded between DFASA and 
GDTD1 B across a broader range of herpes viruses. Sequences are shown for herpes viruses of the 
25 alpha, beta, and gamma subfamilies, and for endogenous mammalian DNA polymerase. 

Figure 9 is a relationship map of DNA polymerases, based on polypeptide sequences shown in 
Figure 8. 

Figure 10 is a listing of the DNA polymerase genes for members of the gamma herpes virus 
subfamily over the same region as Figure 6. This Figure shows the alignment of oligonucleotides LSGGA, 
30 CTDPA, PCLNA, KMLEA and GISPA aligned with the sequences from which they were designed. These 
oligonucleotides are specific for DNA polymerase from the RFHV/KSHV virus subfamily. 

Figure 1 1 is a Hopp-Woods antigenicity plot for the polypeptide fragment of RFHV encoded between 
VYGA and GDTD1B. Indicated below are spans of hydrophobic and antigenic residues in the sequence. 
Figure 12 is a Hopp-Woods antigenicity plot for the polypeptide fragment of KSHV encoded between 
35 DFASA and GDTD1 B. indicated below are spans of hydrophobic and antigenic residues in the sequence. 

Figure 13 is a listing of about 2511 nucleotides of the DNA polymerase encoding sequence of KSHV, 
estimated to be about 3000 nucleotides long, along with the amino acid translation. Additional sequence 
data is provided in the 5' and 3' direction from the PCR segment shown in Figure 1 . 

Figure 14 is a listing comparing a portion of the KSHV DNA polymerase amino acid sequence with 
40 that of other herpes viruses. Asterisks (*) and bullets (•) indicate conserved residues or conservative 
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substitutions. Arrows (t) indicate residues that are conserved amongst other herpes viruses, but different 
in the KSHV sequence. 

Figure 15 is a listing showing known variants of the KSHV DNA polymerase amino acid sequence. 

Figure 16 is a listing of polynucleotide sequences amplified from a DNA polymerase encoding region 
of RFHVMm (designated here as RFMm), RFHVMm is a third member of the RFHV/KSHV herpes virus 
subfamily identified according to the criteria of this invention. Shown for comparison are DNA polymerase 
encoding regions of RFHV (designated RFMn) and KSHV. 

Figure 17 is a listing comparing amino acid sequences encoded in a DNA polymerase encoding 
region of RFHVMm with corresponding sequences of RFHVMn, KSHV, and three other herpes viruses. 

Figure 18 is a statistical phylogeneic analysis of the amino acid alignments in Figure 17. The 
numbers shown are bootstrap values out of 100 repetitions. 

Figure 19 is a map showing approximate hybridization positions of Type 1, Type 2, and Type 3 
oligonucleotide probes in the DNA polymerase nucleotide sequences of members of the RFHV/KSHV 
subfamily. 

Figure 20 is a representative screen for the prevalence of RFHVMn and RFHVMm herpesvirus 
sequences in M. nemestrina monkeys (lanes A-D, I, and J), and M. mulatta monkeys (lanes G and H) in a 
nested amplification assay using virus-specific oligonucleotide primers. 

Best Mode for Carrying Out the Invention 

We have discovered and characterized polynucleotides encoding DNA polymerase from RFHV, 
RFHV2, and KSHV, which are exemplary members of the RFHV/KSHV subfamily of herpes viruses. The 
polynucleotides obtained, related polynucleotides, and corresponding polypeptides and antibodies are 
useful in the diagnosis, clinical monitoring, and treatment of herpes virus infections and related conditions. 

Sources for the polynucleotides from RFHV and KSHV were affected tissue samples taken from 
Macaque nemestrina monkeys with retroperitoneal fibromatosis ("RF") and from humans with Kaposi's 
sarcoma ("KS"), respectively. We predicted that these conditions were associated with viruses distinct 
from those responsible for any contemporaneous immunodeficiency. We did not know in advance that the 
RF and KS associated viruses would be related. 

We decided to test the premise that viruses associated with both conditions are members of the 
herpes virus family. Accordingly, we designed oligonucleotides for use in an amplification reaction to 
obtain polynucleotides encoding a DNA polymerase from a broad spectrum of herpes viruses. Comparing 
amino acid sequences of herpes viruses that have been previously described, three conserved regions 
were identified. The corresponding known polynucleotide sequences were used to construct 
oligonucleotides comprising a degenerate segment and a consensus segment. These oligonucleotides 
served as primers in amplification reactions that yielded fragments of the DNA polymerase encoding 
segment from each of the two tissue sources. 

The sequences of the polynucleotide fragments obtained from the final step of the amplification 
reactions are shown in Figure 1 (SEQ. ID NO:1 and SEQ. ID NO:3, respectively). Both sequences are 
novel, although they contain regions that are highly homologous to regions of DNA polymerase sequences 
from other herpes viruses. The virus infecting the M. nemestrina monkeys was designated 



WO 97/04105 



PCT/US96/11688 



"Retroperitoneal Fibromatosis Herpes Virus" ("RFHV"). The virus infecting the human patients was 
designated "Kaposi's Sarcoma Herpes Virus" ( U KSHV). The polynucleotide sequences shown include 
segments at each end corresponding to the hybridizing regions of the DFASA and GDTD1 B primers used 
in the amplification. The 475 base pair fragment between the primers represents an amplified portion of 
5 the DNA polymerase gene for RFHV and KSHV. 

Since the primers were designed to amplify a broad spectrum of DNA polymerases, we were 
surprised to find that these two DNA polymerase sequences are apparently more closely related to each 
other (71% identity at the nucleotide level) than to any other known herpes virus DNA polymerase. The 
next most closely related polynucleotide sequences are from equine herpes virus 2 (eHV2), saimiri herpes 

10 virus 1 (sHV1), and Epstein Barr virus (EBV). We therefore predict that both RFHV and KSHV are 
members of the herpes gamma subfamily. RFHV and KSHV share with other gamma herpes an 
association with abnormal cellular or fibrotic growth, and an association with immune abnormalities, 
including immunosuppression and B cell dysplasias. However, RFHV and KSHV DNA polymerase 
sequences differ from sHV1 and EBV in the frequency of CpG dinucleotides. RFHV and KSHV DNA 

15 polymerase nucleotide sequences and oligonucleotides based upon them define the RFHV/KSHV 
subfamily as described below. The DNA polymerase sequence of a third member of the subfamily 
infecting M. mulatta monkeys, RFHV2, is also provided. 

The degree of conservation between DNA polymerases means that the polynucleotides and 
polypeptides embodied in this invention are reliable markers amongst different strains of RFHV and KSHV. 

20 Because it is a sequestered antigen, DNA polymerase is not under the same degree of immunological 
pressure to form escape mutants. Furthermore, the sequences are constrained by the critical role that 
these regions play in the catalytic activity of the DNA polymerase. Thus, the polynucleotides, 
polypeptides, and antibodies embodied in this invention are useful in such applications as the detection of 
viral infection in an individual, due to RFHV, KSHV, or other herpes viruses that are of the same subfamily. 

25 Embodiments of the invention are also useful in the characterization of herpes virus DNA polymerase, and 
the design of pharmaceutical therapies. 

Because the DNA polymerase plays a critical role in viral replication, it is an appropriate target for 
pharmacological intervention. Particularly sensitive regions of the molecule are those involved in substrate 
recognition, template binding, catalysis, and association with regulatory subunits. 

30 Polynucleotides of the RFHV/KSHV subfamily, related oligonucleotide probes and primers, related 

polypeptides and antigens, related specific antibodies, the preparation and use of these compounds, and 
related methods and products are described in further detail in the sections that follow. 

Abbreviations 

35 

The following abbreviations are used herein to refer to species of herpes viruses, and polynucleotides 
and genes derived therefrom that encode DNA polymerase: 



-11 - 



WO 97/04105 



PCT/US96/11688 



TABLE 1: Abbreviations for Herpes Virus Strains 




lllliilllllll 






simian Retroperitoneal Fibromatosis- 

associated HeroesVim^ 


gamma-HerpesVirus 


KSHV 


human Kaposi's Sarcoma-associated 
Herpesvirus 




§|§§ 


equine Herpesvirus 2 






saimiri monkey Herpesvirus 1 






human Epstein-Barr Virus 






human CytoMegaloVirus 


beta-HerpesVirus 




murine CytoMegaloVirus 






guinea pig CytoMegaloVirus 






human Herpesvirus 6 




HVZV 


human Varicella-Zoster Virus 


alpha-HerpesVirus 


HHSV1 : 


human Heroes SimDlex Virus 1 

* ■ » Wii 9 • 9 W 1 WWW \m* 1 I 9 lk/1 WAV VII UW 1 






human Herpes Simplex Virus 2 






equine Herpesvirus 1 




IHV1 


ictalurid catfish Herpesvirus 




ttPQLd 


human endogenous DNA polymerase 


eukaryotic delta DNA 
polymerase 




bovine endogenous DNA polymerase 





Definitions 



"RFHV" and "KSHV" are viruses of the herpes family detected in tissue samples of infected macaque 
5 nemestrina monkeys and humans, respectively. Cells infected with these viruses contain polynucleotides 
encoding the respective DNA polymerases as described herein. "RFHNT is synonymous with the terms 
"RFHV1", "RFHVMn", and "RFMn". A third member of the RFHV/KSHV subfamily is a virus identified in a 
M. mulatta monkey. The virus is referred to herein as "RFHV2". "RFHV2" is synonymous with the terms 
"RFHVMm" and "RFMm". 

10 The "RFHV/KSHV subfamily" is a term used herein to refer to a collection of herpes viruses capable 

of infecting vertebrate species. The subfamily consists of members that have sequences that are more 
closely related to the corresponding sequences of RFHV or KSHV than either of these viruses are to any 
other virus listed in Table 1 . The sequence comparison may be made at either the polynucleotide or the 
polypeptide level, and may be across intact genes or proteins, or across fragments thereof. As used 

15 herein, the subfamily refers to herpes viruses that contain a portion of a DNA-polymerase-encoding 
polynucleotide that is more closely identical to the corresponding region of RFHV or KSHV than either of 
these viruses are to the viruses in Table 1. Preferably, the polynucleotide encoding the polymerase 
comprises a segment that is at least 69% identical to that of RFHV (SEQ. ID NO:1) or KSHV (SEQ. ID 
NO:3) between residues 27 and 501; or at least 80% identical to the oligonucleotide LSGGA; or at least 
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69% identical to the oligonucleotide CTDPA; or at least 80% identical to the oligonucleotide KMLEA; or at 
least 69% identical to the oligonucleotide GISPA. 

As used herein, a a DNA polymerase" is a protein or a protein analog, that under appropriate 
conditions is capable of catalyzing the assembly of a DNA polynucleotide with a sequence that is 
5 complementary to a polynucleotide used as a template. A DNA polymerase may also have other catalytic 
activities, such as 3'-5' exonuclease activity; any of the activities may predominate. A DNA polymerase 
may require association with additional proteins or co-factors in order to exercise its catalytic function. 
"DNA polymerase activity" refers to the catalytic activity directed at DNA polynucleotide assembly. A "DNA 
polymerase reaction" is any step in a reaction mechanism on the pathway to polymerization of nucleotides, 

10 including association with substrates, cofactors, and regulatory subunits, the formation of intermediates, 
and the formation of reaction products. 

The term "DNA polymerase gene" includes any gene that encodes a polypeptide capable of a DNA 
polymerase reaction. It also includes any gene that is believed to be derived from an ancestral gene that 
encoded a DNA polymerase, because of homology with other DNA polymerase genes or its location 

1 5 relative to neighboring genes; such a gene may encode a non-functional DNA polymerase analog, a DNA 
polymerase fragment or mutant, or it may be untranscribed or untranslated. 

A "regulatory subunit" for a first polypeptide that has DNA polymerase activity is a second polypeptide 
that regulates DNA polymerase activity of the first polypeptide when associated with it. UL42 is an 
example of a regulatory subunit. 

20 "UL42" or "UL42 subunit" is an accessory protein that is encoded in the genome of some herpes 

viruses. It is capable of associating with the DNA polymerase of the virus. Under certain conditions, it 
enhances the DNA polymerase activity of a polypeptide encoded by a DNA polymerase gene, and may be 
required for the virus to replicate. As used herein, the definition is a functional one, and does not depend 
on the structure or genomic location of the corresponding gene. Thus, a UL42 subunit of RFHV or KSHV 

25 may have a sequence that is not essentially identical to the UL42 subunit of other viruses. 

The terms "polynucleotide" and "oligonucleotide" are used interchangeably, and refer to a polymeric 
form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. 
Polynucleotides may have any three-dimensional structure, and may perform any function, known or 
unknown. The following are non-limiting examples of polynucleotides: a gene or gene fragment, exons, 

30 introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant 
polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated 
RNA of any sequence, nucleic acid probes, and primers. A polynucleotide may comprise modified 
nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the 
nucleotide structure may be imparted before or after assembly of the polymer. The sequence of 

35 nucleotides may be interrupted by non-nucleotide components. A polynucleotide may be further modified 
after polymerization, such as by conjugation with a labeling component. 

The term polynucleotide, as used herein, refers interchangeably to double- and single-stranded 
molecules. Unless otherwise specified or required, any embodiment of the invention described herein that 
is a polynucleotide encompasses both the double-stranded form and each of two complementary 

40 single-stranded forms known or predicted to make up the double-stranded form. 
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In the context of polynucleotides, a "linear sequence" or a "sequence" is an order of nucleotides in a 
polynucleotide in a 5' to 3' direction in which residues that neighbor each other in the sequence are 
contiguous in the primary structure of the polynucleotide. A "partial sequence" is a linear sequence of part 
of a polynucleotide which is known to comprise additional residues in one or both directions. 
5 "Hybridization" refers to a reaction in which one or more polynucleotides react to form a complex that 

is stabilized via hydrogen bonding between the bases of the nucleotide residues. The hydrogen bonding 
may occur by Watson-Crick base pairing, Hoogsteen binding, or in any other sequence-specific manner. 
The complex may comprise two strands forming a duplex structure, three or more strands forming a 
multi-stranded complex, a single self-hybridizing strand, or any combination of these. A hybridization 

10 reaction may constitute a step in a more extensive process, such as the initiation of a PCR, or the 
enzymatic cleavage of a polynucleotide by a ribozyme. 

Hybridization reactions can be performed under conditions of different "stringency". Conditions that 
increase the stringency of a hybridization reaction are widely known and published in the art: see, for 
example, Sambrook Fritsch & Maniatis. Examples of relevant conditions include (in order of increasing 

15 stringency): incubation temperatures of 25°C, 37°C, 50°C, and 68°C; buffer concentrations of 10 x SSC, 6 
x SSC, 1 x SSC, 0.1 x SSC (where SSC is 0.15 M NaCI and 15 mM citrate buffer) and their equivalent 
using other buffer systems; formamide concentrations of 0%, 25%, 50%, and 75%; incubation times from 5 
min to 24 h; and washes of increasing duration, increasing frequency, or decreasing buffer concentrations. 
"T m " is the temperature in degrees Centigrade at which 50% of a polynucleotide duplex made of 

20 complementary strands hydrogen bonded in an antiparallel direction by Watson-Crick base paring 
dissociates into single strands under the conditions of the experiment. T m may be predicted according to 
standard formula; for example: 

T m = 81 .5 + 16.6 log [Na + ] + 0.41 (%G/C) - 0.61 (%F) - 600/L 

25 

where Na + is the cation concentration (usually sodium ion) in mol/L; (%G/C) is the number of G and C 
residues as a percentage of total residues in the duplex; (%F) is the percent formamide in solution (wt/vol); 
and L is the number of nucleotides in each strand of the duplex. 

A "stable duplex" of polynucleotides, or a "stable complex" formed between any two or more 

30 components in a biochemical reaction, refers to a duplex or complex that is sufficiently long-lasting to 
persist between the formation of the duplex or complex, and its subsequent detection. The duplex or 
complex must be able to withstand whatever conditions exist or are introduced between the moment of 
formation and the moment of detection, these conditions being a function of the assay or reaction which is 
being performed. Intervening conditions which may optionally be present and which may dislodge a 

35 duplex or complex include washing, heating, adding additional solutes or solvents to the reaction mixture 
(such as denaturants), and competing with additional reacting species. Stable duplexes or complexes 
may be irreversible or reversible, but must meet the other requirements of this definition. Thus, a transient 
complex may form in a reaction mixture, but it does not constitute a stable complex if it dissociates 
spontaneously or as a result of a newly imposed condition or manipulation introduced before detection. 
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When stable duplexes form in an antiparallel configuration between two single-stranded 
polynucleotides, particularly under conditions of high stringency, the strands are essentially 
"complementary". A double-stranded polynucleotide can be "comple-mentary" to another polynucleotide, if 
a stable duplex can form between one of the strands of the first polynucleotide and the second. A 
5 complementary sequence predicted from the sequence of a single stranded polynucleotide is the optimum 
sequence of standard nucleotides expected to form hydrogen bonding with the single-stranded 
polynucleotide according to generally accepted base-pairing rules. 

A "sense" strand and an "antisense" strand when used in the same context refer to single-stranded 
polynucleotides which are complementary to each other. They may be opposing strands of a 

10 double-stranded polynucleotide, or one strand may be predicted from the other according to generally 
accepted base-pairing rules. Unless otherwise specified or implied, the assignment of one or the other 
strand as "sense" or "antisense" is arbitrary. 

A linear sequence of nucleotides is "identical" to another linear sequence, if the order of nucleotides 
in each sequence is the same, and occurs without substitution, deletion, or material substitution. It is 

15 understood that purine and pyrimidine nitrogenous bases with similar structures can be functionally 
equivalent in terms of Watson-Crick base-pairing; and the inter-substitution of like nitrogenous bases, 
particularly uracil and thymine, or the modification of nitrogenous bases, such as by methylation, does not 
constitute a material substitution. An RNA and a DNA polynucleotide have identical sequences when the 
sequence for the RNA reflects the order of nitrogenous bases in the polyribonucleotide, the sequence for 

20 the DNA reflects the order of nitrogenous bases in the polydeoxyribonucleotide, and the two sequences 
satisfy the other requirements of this definition. Where at least one of the sequences is a degenerate 
oligonucleotide comprising an ambiguous residue, the two sequences are identical if at least one of the 
alternative forms of the degenerate oligonucleotide is identical to the sequence with which it is being 
compared. For example, AYAAA is identical to ATAAA, if AYAAA is a mixture of ATAAA and ACAAA. 

25 When comparison is made between polynucleotides, it is implicitly understood that complementary 

strands are easily generated, and the sense or antisense strand is selected or predicted that maximizes 
the degree of identity between the polynucleotides being compared. For example, where one or both of 
the polynucleotides being compared is double-stranded, the sequences are identical if one strand of the 
first polynucleotide is identical with one strand of the second polynucleotide. Similarly, when a 

30 polynucleotide probe is described as identical to its target, it is understood that it is the complementary 
strand of the target that participates in the hybridization reaction between the probe and the target. 

A linear sequence of nucleotides is "essentially identical" to another linear sequence, if both 
sequences are capable of hybridizing to form duplexes with the same complementary polynucleotide. 
Sequences that hybridize under conditions of greater stringency are more preferred. It is understood that 

35 hybridization reactions can accommodate insertions, deletions, and substitutions in the nucleotide 
sequence. Thus, linear sequences of nucleotides can be essentially identical even if some of the 
nucleotide residues do not precisely correspond or align. Sequences that correspond or align more closely 
to the invention disclosed herein are comparably more preferred. Generally, a polynucleotide region of 
about 25 residues is essentially identical to another region, if the sequences are at least about 80% 

40 identical; more preferably, they are at least about 90% identical; more preferably, they are at least about 
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95% identical; still more preferably, the sequences are 100% identical. A polynucleotide region of 40 
residues or more will be essentially identical to another region, after alignment of homologous portions if 
the sequences are at least about 75% identical; more preferably, they are at least about 80% identical; 
more preferably, they are at least about 85% identical; even more preferably, they are at least about 90% 
5 identical; still more preferably, the sequences are 1 00% identical. 

In determining whether polynucleotide sequences are essentially identical, a sequence that 
preserves the functionality of the polynucleotide with which it is being compared is particularly preferred. 
Functionality can be determined by different parameters. For example, if the polynucleotide is to be used 
in reactions that involve hybridizing with another polynucleotide, then preferred sequences are those which 

10 hybridize to the same target under similar conditions. In general, the T m of a DNA duplex decreases by 
about 1°C for every 1% decrease in sequence identity for duplexes of 200 or more residues; or by about 
5°C for duplexes of less than 40 residues, depending on the position of the mismatched residues (see, 
e.g., Meinkoth et al.). Essentially identical sequences of about 100 residues will generally form a stable 
duplex with each other's respective complementary sequence at about 20°C less than T m ; preferably, they 

15 will form a stable duplex at about 15°C less; more preferably, they will form a stable duplex at about 10°C 
less; even more preferably, they will form a stable duplex at about 5°C less; still more preferably, they will 
form a stable duplex at about T m . In another example, if the polypeptide encoded by the polynucleotide is 
an important part of its functionality, then preferred sequences are those which encode identical or 
essentially identical polypeptides. Thus, nucleotide differences which cause a conservative amino acid 

20 substitution are preferred over those which cause a non-conservative substitution, nucleotide differences 
which do not alter the amino acid sequence are more preferred, while identical nucleotides are even more 
preferred. Insertions or deletions in the polynucleotide that result in insertions or deletions in the 
polypeptide are preferred over those that result in the down-stream coding region being rendered out of 
phase; polynucleotide sequences comprising no insertions or deletions are even more preferred. The 

25 relative importance of hybridization properties and the encoded polypeptide sequence of a polynucleotide 
depends on the application of the invention. 

A polynucleotide has the same "characteristics" of another polynucleotide if both are capable of 
forming a stable duplex with a particular third polynucleotide under similar conditions of maximal 
stringency. Preferably, in addition to similar hybridization properties, the polynucleotides also encode 

30 essentially identical polypeptides. 

"Conserved" residues of a polynucleotide sequence are those residues which occur unaltered in the 
same position of two or more related sequences being compared. Residues that are relatively conserved 
are those that are conserved amongst more related sequences than residues appearing elsewhere in the 
sequences. 

35 "Related" polynucleotides are polynucleotides that share a significant proportion of identical residues. 

As used herein, a "degenerate" oligonucleotide sequence is a designed sequence derived from at 
least two related originating polynucleotide sequences as follows: the residues that are conserved in the 
originating sequences are preserved in the degenerate sequence, while residues that are not conserved in 
the originating sequences may be provided as several alternatives in the degenerate sequence. For 

40 example, the degenerate sequence AYASA may be designed from originating sequences ATACA and 



-16- 



WO 97/04105 



PCT/US96/11688 



ACAGA, where Y is C or T and S is C or G. Y and S are examples of "ambiguous" residues. A 
degenerate segment is a segment of a polynucleotide containing a degenerate sequence. 

It is understood that a synthetic oligonucleotide comprising a degenerate sequence is actually a 
mixture of closely related oligonucleotides sharing an identical sequence, except at the ambiguous 
5 positions. Such an oligonucleotide is usually synthesized as a mixture of all possible combinations of 
nucleotides at the ambiguous positions. Each of the oligonucleotides in the mixture is referred to as an 
"alternative form". The number of forms in the mixture is equal to 



10 



n 

n 



i=l 



where kj is the number of alternative nucleotides allowed at each position. 

As used herein, a "consensus" oligonucleotide sequence is a designed sequence derived from at 

1 5 least two related originating polynucleotide sequences as follows: the residues that are conserved in all 
originating sequences are preserved in the consensus sequence; while at positions where residues are 
not conserved, one alternative is chosen from amongst the originating sequences. In general, the 
nucleotide chosen is the one which occurs in the greatest frequency in the originating sequences. For 
example, the consensus sequence AAAAA may be designed from originating sequences CAAAA, 

20 AAGAA, and AAAAT. A consensus segment is a segment of a polynucleotide containing a consensus 
sequence. 

A polynucleotide "fragment" or "insert" as used herein generally represents a sub-region of the 
full-length form, but the entire full-length polynucleotide may also be included. 

Different polynucleotides "correspond" to each other if one is ultimately derived from another. For 
25 example, messenger RNA corresponds to the gene from which it is transcribed. cDNA corresponds to 
the RNA from which it has been produced, such as by a reverse transcription reaction, or by chemical 
synthesis of a DNA based upon knowledge of the RNA sequence. cDNA also corresponds to the gene 
that encodes the RNA. Polynucleotides also "correspond" to each other if they serve a similar function, 
such as encoding a related polypeptide, in different species, strains or variants that are being 
30 compared. 

A "probe" when used in the context of polynucleotide manipulation refers to an oligonucleotide 
which is provided as a reagent to detect a target potentially present in a sample of interest by 
hybridizing with the target. Usually, a probe will comprise a label or a means by which a label can be 
attached, either before or subsequent to the hybridization reaction. Suitable labels include, but are not 
35 limited to radioisotopes, fluorochromes, chemiluminescent compounds, dyes, and proteins, including 
enzymes. 

A "primer" is an oligonucleotide, generally with a free 3' -OH group, that binds to a target 
potentially present in a sample of interest by hybridizing with the target, and thereafter promotes 
polymerization of a polynucleotide complementary to the target. 
40 Processes of producing replicate copies of the same polynucleotide, such as PCR or gene cloning, 

are collectively referred to herein as "amplification" or "replication". For example, single or double- 
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stranded DNA may be replicated to form another DNA with the same sequence. RNA may be 
replicated, for example, by an RNA-directed RNA polymerase, or by reverse-transcribing the DNA and 
then performing a PCR. In the latter case, the amplified copy of the RNA is a DNA with the identical 
sequence. 

5 A "polymerase chain reaction" ("PCR") is a reaction in which replicate copies are made of a target 

polynucleotide using one or more primers, and a catalyst of polymerization, such as a reverse 
transcriptase or a DNA polymerase, and particularly a thermally stable polymerase enzyme. Generally, 
a PCR involves reiteratively forming three steps: "annealing", in which the temperature is adjusted such 
that oligonucleotide primers are permitted to form a duplex with the polynucleotide to be amplified; 

10 "elongating", in which the temperature is adjusted such that oligonucleotides that have formed a duplex 
are elongated with a DNA polymerase, using the polynucleotide to which they've formed the duplex as 
a template; and "melting", in which the temperature is adjusted such that the polynucleotide and 
elongated oligonucleotides dissociate. The cycle is then repeated until the desired amount of amplified 
polynucleotide is obtained. Methods for PCR are taught in U.S. Patent Nos. 4,683,195 (Mullis) and 

15 4,683,202 (Mullis et al.). 

Elements within a gene include but are not limited to promoter regions, enhancer regions, 
repressor binding regions, transcription initiation sites, ribosome binding sites, translation initiation sites, 
protein encoding regions, introns and exons, and termination sites for transcription and translation. 

A "control element" or "control sequence" is a nucleotide sequence involved in an interaction of 

20 molecules that contributes to the functional regulation of a polynucleotide, including replication, 
duplication, transcription, splicing, translation, or degradation of the polynucleotide. The regulation may 
affect the frequency, speed, or specificity of the process, and may be enhancing or inhibitory in nature. 
Control elements are known in the art. For example, a "promoter" is an example of a control element. 
A promoter is a DNA region capable under certain conditions of binding RNA polymerase and initiating 

25 transcription of a coding region located downstream (in the 3' direction) from the promoter. 

"Operatively linked" refers to a juxtaposition of genetic elements, wherein the elements are in a 
relationship permitting them to operate in the expected manner. For instance, a promoter is operatively 
linked to a coding region if the promoter helps initiate transcription of the coding sequence. There may 
be intervening residues between the promoter and coding region so long as this functional relationship 

30 is maintained. 

The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to refer to 
polymers of amino acids of any length. The polymer may be linear or branched, it may comprise 
modified amino acids, and it may be interrupted by non-amino acids. The terms also encompass an 
amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, 
35 lipidation, acetylation, phosphorylation, or any other manipulation, such as conjugation with a labeling 
component. 

In the context of polypeptides, a "linear sequence" or a "sequence" is an order of amino acids in a 
polypeptide in an N-terminal to C-terminal direction in which residues that neighbor each other in the 
sequence are contiguous in the primary structure of the polypeptide. A "partial sequence" is a linear 
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sequence of part of a polypeptide which is known to comprise additional residues in one or both 
directions. 

A linear sequence of amino acids is "essentially identical" to another sequence if the two 
sequences have a substantial degree of sequence identity. It is understood that the folding and the 
5 biochemical function of proteins can accommodate insertions, deletions, and substitutions in the amino 
acid sequence. Thus, linear sequences of amino acids can be essentially identical even if some of the 
residues do not precisely correspond or align. Sequences that correspond or align more closely to the 
invention disclosed herein are more preferred. It is also understood that some amino acid substitutions 
are more easily tolerated. For example, substitution of an amino acid with hydrophobic side chains, 

10 aromatic side chains, polar side chains, side chains with a positive or negative charge, or side chains 
comprising two or fewer carbon atoms, by another amino acid with a side chain of like properties can 
occur without disturbing the essential identity of the two sequences. Methods for determining 
homologous regions and scoring the degree of homology are well known in the art; see for example 
Altschul et al. and Henikoff et al. Well-tolerated sequence differences are referred to as "conservative 

1 5 substitutions". Thus, sequences with conservative substitutions are preferred over those with other 
substitutions in the same positions; sequences with identical residues at the same positions are still 
more preferred. Generally, a polypeptide region of about 25 residues is essentially identical to another 
region if the sequences are at least about 80% identical; more preferably, they are at least about 85% 
identical; more preferably, they are at least about 90% identical; more preferably, they are at least about 

20 95% identical; still more preferably, the sequences are 100% identical. A polypeptide region of 40 
residues or more will be essentially identical to another region, after alignment of homologous portions, 
if the sequences are at least about 70% identical; more preferably, they are at least about 70% 
identical, and comprise at least another 10% which are either identical or are conservative substitutions; 
more preferably, they are at least about 80% identical; more preferably, they are at least about 80% 

25 identical, and comprise at least another 10% which are either identical or are conservative substitutions; 
more preferably, they are at least about 90% identical; still more preferably, the sequences are 100% 
identical. 

In determining whether polypeptide sequences are essentially identical, a sequence that preserves 
the functionality of the polypeptide with which it is being compared is particularly preferred. Functionality 

30 may be established by different parameters, such as enzymatic activity, the binding rate or affinity in a 
substrate-enzyme or receptor-ligand interaction, the binding affinity with an antibody, and X-^ray 
crystallographic structure. 

A polypeptide has the same "characteristics" of another polypeptide if it displays the same 
biochemical function, such as enzyme activity, ligand binding, or antibody reactivity. Preferred 

35 characteristics of a polypeptide related to a DNA polymerase or a DNA polymerase fragment are DNA 
polymerase activity, DNA template binding, and the binding of deoxyribonucieotide triphosphates. Also 
preferred is a polypeptide that displays the same biochemical function as the polypeptide with which it x 
is being compared, and in addition, is believed to have a similar three-dimensional conformation, as 
predicted by computer modeling or determined by such techniques as X-ray crystallography. 
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The "biochemical function" or "biological activity" of a polypeptide includes any feature of the 
polypeptide detectable by suitable experimental investigation. "Altered" biochemical function can refer 
to a change in the primary, secondary, tertiary, or quaternary structure of the polypeptide; detectable, 
for example, by molecular weight determination, circular dichroism, antibody binding, difference 
5 spectroscopy, or nuclear magnetic resonance. It can also refer to a change in reactivity, such as the 
ability to catalyze a certain reaction, or the ability to bind a cofactor, substrate, inhibitor, drug, hapten, or 
other polypeptide. A substance may be said to "interfere" with the biochemical function of a polypeptide 
if it alters the biochemical function of the polypeptide in any of these ways. 

A "fusion polypeptide" is a polypeptide comprising regions in a different position in the sequence 
10 than occurs in nature. The regions may normally exist in separate proteins and are brought together in 
the fusion polypeptide; or they may normally exist in the same protein but are placed in a new 
arrangement in the fusion polypeptide. A fusion polypeptide may be created, for example, by chemical 
synthesis, or by creating and translating a polynucleotide in which the peptide regions are encoded in 
the desired relationship. 

15 An "antibody" (interchangeably used in plural form) is an immunoglobulin molecule capable of 

specific binding to a target, such as a polypeptide, through at least one antigen recognition site, located 
in the variable region of the immunoglobulin molecule. As used herein, the term encompasses not only 
intact antibodies, but also fragments thereof, mutants thereof, fusion proteins, humanized antibodies, 
and any other modified configuration of the immunoglobulin molecule that comprises an antigen 

20 recognition site of the required specificity. 

"Immunological recognition" or "immunological reactivity" refers to the specific binding of a target 
through at least one antigen recognition site in an immunoglobulin or a related molecule, such as a B 
cell receptor or a T cell receptor. 

The term "antigen" refers to the target molecule that is specifically bound by an antibody through 

25 its antigen recognition site. The antigen may, but need not be chemically related to the immunogen that 
stimulated production of the antibody. The antigen may be polyvalent, or it may be a monovalent 
hapten. Examples of kinds of antigens that can be recognized by antibodies include polypeptides, 
polynucleotides, other antibody molecules, oligosaccharides, complex lipids, drugs, and chemicals. 
An "immunogen" is an antigen capable of stimulating production of an antibody when injected into 

30 a suitable host, usually a mammal. Compounds may be rendered immunogenic by many techniques 
known in the art, including crosslinking or conjugating with a carrier to increase valency, mixing with a 
mitogen to increase the immune response, and combining with an adjuvant to enhance presentation. 

A "vaccine" is a pharmaceutical preparation for human or animal use, which is administered with 
the intention of conferring the recipient with a degree of specific immunological reactivity against a 

35 particular target, or group of targets. The immunological reactivity may be antibodies or cells 
(particularly B cells, plasma cells, T helper cells, and cytotoxic T lymphocytes, and their precursors) that 
are immunologically reactive against the target, or any combination thereof. Possible targets include 
foreign or pathological compounds, such as an exogenous protein, a pathogenic virus, or an antigen 
expressed by a cancer cell. The immunological reactivity may be desired for experimental 
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purposes, for the treatment of a particular condition, for the elimination of a particular substance, or for 
prophylaxis against a particular condition or substance. 

A "passive vaccine" is a vaccine that does not require participation of the recipient's immune 
response to exert its effect. Usually, it is comprised of antibody molecules reactive against the target. 
5 The antibodies may be obtained from a donor subject and sufficiently purified for administration to the 
recipient, or they may be produced in vitro, for example, from a culture of hybridoma cells, or by 
genetically engineering a polynucleotide encoding an antibody molecule. 

An "active vaccine" is a vaccine administered with the intention of eliciting a specific immune 
response within the recipient, that in turn has the desired immunological reactivity against the target. 
10 An active vaccine comprises a suitable immunogen. The immune response that is desired may be 
either humoral or cellular, systemic or secretory, or any combination of these. 

A "reagent" polynucleotide, polypeptide, or antibody, is a substance provided for a reaction, the 
substance having some known and desirable parameters for the reaction. A reaction mixture may also 
contain a "target", such as a polynucleotide, antibody, or polypeptide that the reagent is capable of 
15 reacting with. For example, in some types of diagnostic tests, the amount of the target in a sample is 
determined by adding a reagent, allowing the reagent and target to react, and measuring the amount of 
reaction product. In the context of clinical management, a "target" may also be a cell, collection of cells, 
tissue, or organ that is the object of an administered substance, such as a pharmaceutical compound. 

An "isolated" polynucleotide, polypeptide, protein, antibody, or other substance refers to a 
20 preparation of the substance devoid of at least some of the other components that may also be present 
where the substance or a similar substance naturally occurs or is initially obtained from. Thus, for 
example, an isolated substance may be prepared by using a purification technique to enrich it from a 
source mixture. Enrichment can be measured on an absolute basis, such as weight per volume of 
solution, or it can be measured in relation to a second, potentially interfering substance present in the 
25 source mixture. Increasing enrichments of the embodiments of this invention are increasingly more 
preferred. Thus, for example, a 2-fold enrichment is preferred, 10-fold enrichment is more preferred, 
100-fold enrichment is more preferred, 1000-fold enrichment is even more preferred. A substance can 
also be provided in an isolated state by a process of artificial assembly, such as by chemical synthesis 
or recombinant expression. 

30 A polynucleotide used in a reaction, such as a probe used in a hybridization reaction, a primer 

used in a PCR, or a polynucleotide present in a pharmaceutical preparation, is referred to as "specific" 
or "selective" if it hybridizes or reacts with the intended target more frequently, more rapidly, or with 
greater duration than it does with alternative substances. Similarly, a polypeptide is referred to as 
"specific" or "selective" if it binds an intended target, such as a ligand, hapten, substrate, antibody, or 

35 other polypeptide more frequently, more rapidly, or with greater duration than it does to alternative 
substances. An antibody is referred to as "specific" or "selective" if it binds via at least one antigen 
recognition site to the intended target more frequently, more rapidly, or with greater duration than it 
does to alternative substances. A polynucleotide, polypeptide, or antibody is said to "selectively inhibit" 
or "selectively interfere with" a reaction if it inhibits or interferes with the reaction between particular 
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substrates to a greater degree or for a greater duration than it does with the reaction between 
alternative substrates. 

A "pharmaceutical candidate" or "drug candidate" is a compound believed to have therapeutic 
potential, that is to be tested for efficacy. The "screening" of a pharmaceutical candidate refers to 
5 conducting an assay that is capable of evaluating the efficacy and/or specificity of the candidate. In this 
context, "efficacy" refers to the ability of the candidate to affect the cell or organism it is administered to 
in a beneficial way: for example, the limitation of the pathology due to an invasive virus. 

The "effector component" of a pharmaceutical preparation is a component which modifies target 
cells by altering their function in a desirable way when administered to a subject bearing the cells. 

10 Some advanced pharmaceutical preparations also have a "targeting component", such as an antibody, 
which helps deliver the effector component more efficaciously to the target site. Depending on the 
desired action, the effector component may have any one of a number of modes of action. For 
example, it may restore or enhance a normal function of a cell, it may eliminate or suppress an 
abnormal function of a cell, or it may alter a cell's phenotype. Alternatively, it may kill or render dormant 

15 a cell with pathological features, such as a virally infected cell. Examples of effector components are 
provided in a later section. 

A "cell line" or "ceil culture" denotes higher eukaryotic cells grown or maintained in vitro. It is 
understood that the descendants of a cell may not be completely identical (either morphologically, 
genotypically, or phenotypically) to the parent cell. 

20 A "host cell" is a cell which has been transformed, or is capable of being transformed, by 

administration of an exogenous polynucleotide. A "host cell" includes progeny of the original 
transformant. 

"Genetic alteration" refers to a process wherein a genetic element is introduced into a cell other than 
by natural cell division. The element may be heterologous to the cell, or it may be an additional copy or 

25 improved version of an element already present in the cell. Genetic alteration may be effected, for 
example, by transfecting a cell with a recombinant plasmid or other polynucleotide through any process 
known in the art, such as electroporation, calcium phosphate precipitation, contacting with a 
polynucleotide-liposome complex, or by transduction or infection with a DNA or RNA virus or viral vector. 
The alteration is preferably but not necessarily inheritable by progeny of the altered cell. 

30 An "individual" refers to vertebrates, particularly members of a mammalian species, and includes 

but is not limited to domestic animals, sports animals, and primates, including humans. 

The term "primate" as used herein refers to any member of the highest order of mammalian 
species. This includes (but is not limited to) prosimians, such as lemurs and lorises; tarsioids, such as 
tarsiers; new-world monkeys, such as squirrel monkeys (Saimiri sciureus) and tamarins; old-world 

35 monkeys such as macaques (including Macaca nemestrina, Macaca fascicularis, and Macaca fuscata); 
hylobatids, such as gibbons and siamangs; pongids, such as orangutans, gorillas, and chimpanzees; 
and hominids, including humans. 

The "pathology" caused by a herpes virus infection is anything that compromises the well-being or 
normal physiology of the host. This may involve (but is not limited to) destructive invasion of the virus 

40 into previously uninfected cells, replication of the virus at the expense of the normal metabolism of the 
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cell, generation of toxins or other unnatural molecules by the virus, irregular growth of cells or 
intercellular structures (including fibrosis), irregular or suppressed biological activity of infected cells, 
malignant transformation, interference with the normal function of neighboring cells, aggravation or 
suppression of an inflammatory or immunological response, and increased susceptibility to other 
5 pathogenic organisms and conditions. 

"Treatment" of an individual or a cell is any type of intervention in an attempt to alter the natural 
course of the individual or cell. For example, treatment of an individual may be undertaken to decrease 
or limit the pathology caused by a herpes virus infecting the individual. Treatment includes (but is not 
limited to) administration of a composition, such as a pharmaceutical composition, and may be 

10 performed either prophylactically, or therapeutically, subsequent to the initiation of a pathologic event or 
contact with an etiologic agent. 

It is understood that a clinical or biological "sample" encompasses a variety of sample types 
obtained from a subject and useful in an in vitro procedure, such as a diagnostic test. The definition 
encompasses solid tissue samples obtained as a surgical removal, a pathology specimen, or a biopsy 

15 specimen, tissue cultures or cells derived therefrom and the progeny thereof, and sections or smears 
prepared from any of these sources. Non-limiting examples are samples obtained from infected sites, 
fibrotic sites, unaffected sites, and tumors. The definition also encompasses blood, spinal fluid, and 
other liquid samples of biologic origin, and may refer to either the cells or cell fragments suspended 
therein, or to the liquid medium and its solutes. The definition also includes samples that have been 

20 solubilized or enriched for certain components, such as DNA, RNA, protein, or antibody. 

Oligonucleotide primers and probes described herein have been named as follows: The first part 
of the designation is the single amino acid code for a portion of the conserved region of the DNA 
polymerase they are based upon, usually 4 residues long. This is followed with the letter A or B, 
indicating respectively that the oligonucleotide is complementary to the sense or anti-sense strand of 

25 the DNA polymerase encoding region. Secondary consensus oligonucleotides used for sequencing 
have the letters SQ at the end of the designation. 

General techniques 

30 The practice of the present invention will employ, unless otherwise indicated, conventional techniques 

of molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. 
Such techniques are explained fully in the literature. See, for example, "Molecular Cloning: A Laboratory 
Manual", Second Edition (Sambrook, Fritsch & Maniatis, 1989), "Oligonucleotide Synthesis" (M.J. Gait, ed., 
1984), "Animal Ceil Culture" (R.I. Freshney, ed., 1987); the series "Methods in Enzymology" (Academic 

35 Press, Inc.); "Handbook of Experimental Immunology" (D.M. Weir & C.C. Blackwell, eds.), "Gene Transfer 
Vectors for Mammalian Cells" (J.M. Miller & MP. Calos, eds., 1987), "Current Protocols in Molecular 
Biology" (F.M. Ausubel et al., eds., 1987); and "Current Protocols in Immunology" (J.E. Coligan et al. t eds., 
1991). 

All patents, patent applications, articles and publications mentioned herein, both supra and infra, are 
40 hereby incorporated herein by reference. 



-23- 



WO 97/04105 



PCT/US96/11688 



Polynucleotides encoding DNA polymerase of the herpes virus RFHV/KSHV subfamily 

This invention embodies isolated polynucleotide segments derived from DNA polymerase genes 
5 present in herpes viruses, preferably that encode a fragment of a polypeptide that is capable of a DNA 
polymerase reaction. Polynucleotides provided are from the RFHV/KSHV subfamily of herpes viruses. 
Preferred polynucleotides are those encoding a fragment of a DNA polymerase from either RFHV or 
KSHV. Preferred fragments are those that have been amplified and isolated from the DNA polymerase 
gene, as described in the Examples below. Exemplary fragments are shown in Figure 1 , and designated 

10 SEQ. ID NO:1 and SEQ. ID NO:3, respectively. Especially preferred are polynucleotides comprising the 
sequence between residues 27 and 501 of the RFHV sequence (SEQ. ID NO:1), and the sequence 
between residues 27 and 329 of the KSHV sequence (SEQ. ID NO:3). 

The polynucleotide segments of RFHV and KSHV between residues 27 and 501 (the 475 base pair 
fragment underlined in Figure 1) are 71% identical. Shared residues are indicated in Figure 1 by u *\ The 

1 5 largest number of consecutive bases shared between RFHV and KSHV within this segment is 17. 

The 475 base pair fragments of RFHV and KSHV are more identical to each other than either of them 
are to the corresponding segment of any of the previously sequenced herpes viruses of Table 1 . The next 
most closely related sequence is the DNA polymerase from the eHV2 virus, which is about 68% identical 
in this region to either RFHV or KSHV. Contained within this region is a first 20 base pair subfragment 

20 (SEQ. ID NO: 110) and a second 20 base pair subfragment (SEQ. ID NO:111) which is shared identically 
between eHV2 and RFHV. The 475 base pair region is less than about 65% identical between either 
RFHV or KSHV and the other known DNA polymerase sequences from herpes viruses capable of infecting 
primates, including sHV1 and EBV. The longest subfragment shared identically between RFHV or KSHV, 
and sHV1, is about 14 bases in length. The longest subfragment shared identically between RFHV or 

25 KSHV, and EBV, is about 15 bases in length. It is predicted that polynucleotide sequences are more likely 
to be conserved between herpes virus DNA polymerase encoding regions than with other polynucleotides. 
Therefore, other than the two subfragments shared with eHV2, it is believed that any subfragment of the 
RFHV or KSHV sequence of 18 base pairs or longer will be unique to the RFHV/KSHV subfamily, or to 
particular herpes virus species and variants within the subfamily. 

30 This invention embodies subfragments contained in the DNA polymerase gene of the RFHV/KSHV 

subfamily, preferably contained in the region corresponding to the 475 base pair fragment between 
residues 27-501, as shown in Figure 1. Preferably, the sub-fragments are at least about 16 residues in 
length; more preferably they are at least 18 residues in length; more preferably they are at least 20 
nucleotides in length; more preferably they are at least about 25 nucleotides in length; more preferably 

35 they are at least about 35 nucleotides in length; still more preferably they are at least about 50 nucleotides 
in length; yet more preferably they are at least about 75 nucleotides in length, and even more preferably 
they are 100 nucleotides in length or more. Also embodied in this invention are polynucleotides 
comprising the entire open reading frame of each respective herpes virus DNA polymerase. 

To predict the role encoded peptide fragments play in the biological function of the DNA polymerase, 

40 comparisons may be made with other DNA polymerases. Conserved regions in the amino acid sequence 
of DNA polymerase from various herpes viruses are shown in Figure 2. The areas labeled Exol, Exoll, 
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and Exolll have been shown to be important binding sites for metal ligands at the 3-5' exonuclease active 
site (Derbyshire et al., Bernard et al. (1989), Simon et aL, Soengas et al.). The area designated as 
REGION 1 has been shown to be important in polymerization activity, and functions both as a drug binding 
site and polymerization substrate (deoxyribonucleotide triphosphate) binding site (Dorsky et aL (1988, 
5 1990), Bernard et al. (1990)). A mutation of the amino acid G to A in this region of herpes simplex (HSV) 1 
DNA polymerase inhibits polymerase activity in virus-infected cells. A mutation of F to C, Y or M yield 
different sensitivities to drugs such as nucleoside and pyrophosphate analogs, and aphidicolin. REGION 2 
and REGION 3 of the HSV1 DNA polymerase appear to be involved in drug and substrate recognition 
(Gibbs et al. (1988a, 1988b), Basco et al. (1993)). REGION 3 is involved in binding to the DNA template 

10 (Basco et al. (1992)). REGION 7 may be important in polymerization activity (Basco et al. (1993)). In 
some herpes viruses such as HSV1, amino acids near the C-terminal are involved in binding to a 
regulatory subunit known as UL42, encoded elsewhere in the herpes genome, and essential for DNA 
polymerase activity associated with replication of the virus (Dignard et al., Stow). 

The RFHV and KSHV polynucleotides shown in Figure 1 are near regions of the polynucleotide that 

15 encode functionally important parts of the DNA polymerase. Specifically, the oligonucleotides DFASA, 
VYGA, and GDTD1B map respectively to REGION 2, REGION 3, and REGION 1. The fragment between 
DFASA and GDTD1 B obtained for KSHV encompasses the entire REGION 4 and REGION 3 sequences, 
and overlaps with the REGION 2 and REGION 1 sequences. 

The RFHV/KSHV subfamily consists of members that have sequences that are more closely identical 

20 to the corresponding sequences of RFHV or KSHV, than RFHV or KSHV are to any other virus listed in 
Table 1 . Preferred members of the family may be identified on the basis of the sequence of the DNA 
polymerase gene in the region corresponding to that of Figure 1 . Table 2 provides the degree of sequence 
identities in this region: 

25 
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TABLE 2: Sequence Identities Between DNA Polymerase of Select Herpes Viruses and RFHV and 
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The percentage of sequence identity is calculated by first aligning the encoded amino acid 
sequence, determining the corresponding alignment of the encoding polynucleotide, and then counting 
5 the number of residues shared between the sequences being compared at each aligned position. No 
penalty is imposed for the presence of insertions or deletions, but insertions or deletions are permitted 
only where required to accommodate an obviously increased number of amino acid residues in one of 
the sequences being aligned. Offsetting insertions just to improve sequence alignment are not 
permitted at either the polypeptide or polynucleotide level. Thus, any insertions in the polynucleotide 

10 sequence will have a length which is a multiple of 3. The percentage is given in terms of residues in the 
test sequence that are identical to residues in the comparison or reference sequence. 

The degree of identity between viruses in Table 2 has been calculated for segments of the RFHV 
and KSHV sequence numbered as shown in Figure 1 . 

Preferred DNA polymerase-encoding polynucleotide sequences of this invention are those derived 

15 from the RFHV/KSHV herpes virus subfamily. They include those sequences that are at least 69% 
identical with the RFHV or KSHV sequence between bases 27 and 501 as shown in Figure 1; more 
preferably, the sequences are at least 70% identical; more preferably, the sequences are at least about 
72% identical; more preferably, the sequences are at least about 75% identical; more preferably, the 
sequences are at least about 80% identical; more preferably, the sequences are at least about 85% 

20 identical; more preferably, the sequences are at least about 90% identical; even more preferably, the 
sequences are over 95% identical. Also preferred are sequences that are at least 69% identical to the 
RFHV sequence between bases 27 and 329; more preferably, they are 70% identical; more preferably, 
they are at least 72% identical; more preferably, they are at least 75% identical; more preferably, they 
are at least 80% identical; more preferably, the sequences are at least 90% identical; even more 
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preferably, the sequences are at least 95% identical. Also preferred are sequences that are at least 
72% identical to the KSHV sequence between bases 27 and 329; more preferably, they are at least 
75% identical; more preferably, they are at least 80% identical; more preferably, they are at least 90% 
identical; even more preferably, they are 95% identical or more. 

Other preferred DNA polymerase-encoding polynucleotide sequences may be identified by the 
percent identity with RFHV/KSHV subfamily-specific oligonucleotides, described in more detail in a 
further section. The percent identity of RFHV and KSHV DNA polymerase with example 
oligonucleotides is shown in Table 3: 
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The percent identity shown in Table 3 was calculated for the corresponding residues of the viral 
sequences, aligned as shown in Figure 6. 

Preferred DNA polymerase sequences are those which over the corresponding region are at least 
about 80% identical to LSGGA; more preferably they are at least about 83% identical; more preferably 

15 they are at least about 86% identical; more preferably they are at least about 90% identical; even more 
preferably, they are at least 95% identical. Other preferred DNA polymerase sequences are those 
which over the corresponding region are at least about 69% identical to CTDPA; more preferably they 
are at least about 72% identical; more preferably they are at least about 75% identical; more preferably 
they are at least about 80% identical; more preferably they are at least about 85% identical; even more 

20 preferably, they are at least about 95% identical. Other preferred DNA polymerase sequences are 
those which over the corresponding region are at least about 95% identical to PCLNA. Other preferred 
DNA polymerase sequences are those which over the corresponding region are at least about 80% 
identical to KMLEA; more preferably they are at least about 83% identical; more preferably they are at 
least about 86% identical; more preferably they are at least about 90% identical; even more preferably, 

25 they are at least 95% identical or more. Other preferred DNA polymerase sequences are those which 
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over the corresponding region are at least about 69% identical to GISPA; more preferably they are at 
least about 72% identical; more preferably they are at least about 75% identical; more preferably they 
are at least about 80% identical; more preferably they are at least about 85% identical; even more 
preferably, they are at least about 95% identical. 
5 DNA polymerase encoding sequences from members of the RFHV/KSHV subfamily identified by 

any of the aforementioned sequence comparisons, using either RFHV or KSHV sequences, or the 
subfamily-specific oligonucleotides, are equally preferred. Especially preferred are DNA polymerase 
encoding sequences of RFHV and KSHV. Also embodied in this invention are fragments of DNA 
polymerase encoding sequences of the subfamily, and longer polynucleotides comprising such 
1 0 polynucleotide fragments. 

The polynucleotide sequences described in this section provide a basis for obtaining the synthetic 
oligonucleotides, proteins and antibodies outlined in the sections that follow. These compounds may 
be prepared by standard techniques known to a practitioner of ordinary skill in the art, and may be used 
for a number of investigative, diagnostic, and therapeutic purposes, as described below. 

15 

Preparation of polynucleotides 

Polynucleotides and oligonucleotides of this invention may be prepared by any suitable method 
known in the art. For example, oligonucleotide primers can be used in a PCR amplification of DNA 

20 obtained from herpes virus infected tissue, as in Example 3 and Example 5, described below. 
Alternatively, oligonucleotides can be used to identify suitable bacterial clones of a DNA library, as 
described below in Example 10. 

Polynucleotides may also be prepared directly from the sequence provided herein by chemical 
synthesis. Several methods of synthesis are known in the art, including the triester method and the 

25 phosphite method. In a preferred method, polynucleotides are prepared by solid-phase synthesis using 
mononucleoside phosphoramidite coupling units. See, for example Horise et al., Beaucage et al., Kumar 
et al., and U.S. Patent No. 4,415,732. 

A typical solid-phase synthesis involves reiterating four steps: deprotection, coupling, capping, and 
oxidation. This results in the stepwise synthesis of an oligonucleotide in the 3' to 5' direction. 

30 In the first step, the growing oligonucleotide, which is attached at the 3-end via a (-0-) group to a 

solid support, is deprotected at the 5' end. For example, the 5' end may be protected by a -ODMT group, 
formed by reacting with 4,4-dimethoxytrityl chloride (DMT-CI) in pyridine. This group is stable under basic 
conditions, but is easily removed under acid conditions, for example, in the presence of dichloroacetic acid 
(DCA) or trichloroacetic acid (TCA). Deprotection provides a 5' -OH reactive group. 

35 In the second step, the oligonucleotide is reacted with the desired nucleotide monomer, which itself 

has first been converted to a 5-protected, 3'-phosphoramidite. The 5' -OH of the monomer may be 
protected, for example, in the form of a -ODMT group, and the 3-OH group may be converted to a 
phosphoramidite, such as -OP(OR')NR 2 ; where R is the isopropyl group -CH(CH 3 ) 2 ; and R' is, for example, 
-H (yielding a phosphoramidite diester), or -CH 3 , -CH 2 CH 3 , or the beta-cyanoethyl group -CH 2 CH 2 CN 

40 (yielding a phosphoramidite triester). The 3'-phosphoramidite group of the monomer reacts with the 5* -OH 
group of the growing oligonucleotide to yield the phosphite linkage 5 -OP(OR')0-3\ 
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(yielding a phosphoramidite triester). The 3'-phosphoramidite group of the monomer reacts with the 5' -OH 
group of the growing oligonucleotide to yield the phosphite linkage 5'-OP(OR')0-3\ 

In the third step, oligonucleotides that have not coupled with the monomer are withdrawn from further 
synthesis to prevent the formation of incomplete polymers. This is achieved by capping the remaining 5' - 
5 OH groups, for example, in the form of acetates (-OC(0)CH 3l ) by reaction with acetic anhydride (CH 3 C(0)- 
0-C(0)CH 3 ). 

In the fourth step, the newly formed phosphite group (i.e., 5'-0P(0R')0-3') is oxidized to a phosphate 
group (i.e., 5 , -OP(=0)(OR , )0-3 f ); for example, by reaction with aqueous iodine and pyridine. 

The four-step process may then be reiterated, since the oligonucleotide obtained at the end of the 
10 process is 5*-protected and is ready for use in step one. When the desired full-length oligonucleotide has 
been obtained, it may be cleaved from the solid support, for example, by treatment with alkali and heat. 
This step may also serve to convert phosphate triesters (i.e., when R* is not -H) to the phosphate diesters 
(-OP(=0) 2 0-), and to deprotect base-labile protected amino groups of the nucleotide bases. 

Polynucleotides prepared by any of these methods can be replicated to provide a larger supply by 
1 5 any standard technique, such as PCR amplification or gene cloning. 

Cloning and expression vectors comprising a DNA polymerase encoding polynucleotide 

Cloning vectors and expression vectors are provided in this invention that comprise a sequence 
20 encoding a herpes virus DNA polymerase or variant or fragment thereof. Suitable cloning vectors may be 
constructed according to standard techniques, or may be selected from the large number of cloning 
vectors available in the art. While the cloning vector selected may vary according to the host cell intended 
to be used, useful cloning vectors will generally have the ability to self-replicate, may possess a single 
target for a particular restriction endonuclease, and may carry genes for a marker that can be used in 
25 selecting transfected clones. Suitable examples include plasmids and bacterial viruses; e.g., pUC18, 
mp18, mp19, pBR322, pMB9, CoIE1, pCR1, RP4, phage DNAs, and shuttle vectors like pSA3 and pAT28. 

Expression vectors generally are replicable polynucleotide constructs that encode a polypeptide 
operatively linked to suitable transcriptional and translational controlling elements. Examples of 

30 transcriptional controlling elements are promoters, enhancers, transcription initiation sites, and 
transcription termination sites. Examples of translational controlling elements are ribosome binding sites, 
translation initiation sites, and stop codons. Protein processing elements may also be included: for 
example, regions that encode leader or signal peptides and protease cleavage sites required for 
translocation of the polypeptide across the membrane or secretion from the cell. The elements employed 

35 would be functional in the host cell used for expression. The controlling elements may be derived from the 
same DNA polymerase gene used in the vector, or they may be heterologous (i.e., derived from other 
genes and/or other organisms). 

Polynucleotides may be inserted into host cells by any means known in the art. Suitable host cells 
include bacterial cells such as E. coli, mycobacteria, other procaryotic microorganisms and eukaryotic cells 

40 (including fungal cells, insect cells, plant cells, and animal cells). The cells are transformed by inserting the 
exogenous polynucleotide by direct uptake, endocytosis, transfection, f-mating, or electroporation. 
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Subsequently, the exogenous polynucleotide may be maintained within the cell as a non-integrated vector, 
such as a plasmid, or may alternatively be integrated into the host cell genome. 

Cloning vectors may be used to obtain replicate copies of the polynucleotides they contain, or as a 
means of storing the polynucleotides in a depository for future recovery. Expression vectors and host cells 
5 may be used to obtain polypeptides transcribed by the polynucleotides they contain. They may also be 
used in assays where it is desirable to have intact cells capable of synthesizing the polypeptide, such as in 
a drug screening assay. 

Synthetic oligonucleotides for herpes virus DNA polymerase useful as hybridization probes 
1 0 and amplification primers 

Oligonucleotides designed from sequences of herpes virus DNA polymerase, as embodied in this 
invention, can be used as probes to identify related sequences, or as primers in an amplification reaction 
such as a PCR. 

15 Different oligonucleotides with different properties are described in the sections that follow. 

Oligonucleotides designated as Type 1 are designed to hybridize with polynucleotides encoding any 
herpes virus DNA polymerase, and may be used to detect previously known species of herpes virus. They 
may also be used to detect and characterize new species of herpes virus. Oligonucleotides designated as 
Type 2 are designed to hybridize with DNA polymerase encoding polynucleotides of the RFHV/KSHV 

20 subfamily, including members not yet identified, but not with polynucleotides of other herpes viruses. 
Oligonucleotides designated as Type 3 are designed to hybridize specifically with polynucleotides 
encoding DNA polymerase only from RFHV, or alternatively from KSHV. 

Preferred examples of Type 1 oligonucleotides are listed in Table 4. These oligonucleotides have a 
specificity for DNA polymerase encoding polynucleotides of a broad range of herpes viruses. 
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TABLE 4: Type 1 Oligonucleotides used for Detecting, Amplifying, or Characterizing Herpes Virus 
. ;•• ' I Polynucleotides encoding DNA Polymerase 



: |liiln|rtjQn||||l 


lillllllM^ 




•l||id||||;| 
||forms|i§ 




||6riert|ii 


llsiil 

ID: 




I Cj 1 1 OCaAC 1 1 YGCNAGYYTNT 
AYCC 


26 


256 


Herpes DNA 
polymerase 


5->3 


mm 

5 


DFCiSA 


GTGTTCGACTTYCARAGYYTNT 
AYCC 


26 


128 


Herpes DNA 
polymerase, 
especially the 
beta subfamily 


5'->3' 


6 


VYGA 


ACGTGCAACGCGGTGTAYGGN 
KTNACNGG 


29 


256 


Herpes DNA 
polymerase 


5'->3' 


7 


; : :i:|;:^Q^:S;;|: 


ACGTGCAACGCGGTGTACGGS 


29 


8 


Herpes DNA 
polymerase 

(GC-rich) 


5'-»3' 


8 




ACGTGCAACGCGGTGTA 


17 


1 


Herpes DNA 
polymerase 


5'->3* 


9 


l : tG^D|||i| 


CGGCATGCGACAAACACGGAG 
TCNGTRTCNCCRTA 


35 


64 


Herpes DNA 
polymerase 


3'->5' 


11 




CGGCATGCGACAAACACGGA 


20 


1 


Herpes DNA 


3*->5' 


12 










polymerase 



















The orientation indicated is relative to the encoding region of the polynucleotide. Oligomers with a 
5'->3' orientation will hybridize to the strand antisense to the coding strand and initiate amplification in 
the direction of the coding sequence. Oligomers with a 3'->5' orientation will hybridize to the coding 
5 strand and initiate amplification in the direction opposite to the coding sequence. 

These oligonucleotides have been designed with several properties in mind: 1) sensitivity for 
target DNA even when present in the source material at very low copy numbers; 2) sufficient specificity 
to avoid hybridizing with unwanted sequences; for example, endogenous DNA polymerase sequences 
present in the host; 3) sufficient cross-reactivity so that differences between an unknown target and the 
10 sequence used to design it do not prevent the oligonucleotide from forming a stable duplex with the 
target. 

For some applications, a particularly effective design is oligonucleotides that have a degenerate 
segment at the 3' end, designed from a region of at least 2 known polynucleotides believed to be 
somewhat conserved with the polynucleotide target. The various permutations of the ambiguous 

1 5 residues help ensure that at least one of the alternative forms of the oligonucleotide will be able to 
hybridize with the target. Adjacent to the degenerate segment at the 5' end of the oligonucleotide is a 
consensus segment which strengthens any duplex which may form and permits hybridization or 
amplification reactions to be done at higher temperatures. The degenerate segment is located at the 3' 
end of the molecule to increase the likelihood of a close match between the oligonucleotide and the 

20 target at the site where elongation begins during a polymerase chain reaction. 

The ambiguous residues in the degenerate part of the sequences are indicated according to the 
following code: 
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TABLE 5: Single Letter Codes for Ambiguous Positions 






R 


AorG (purine) 


Y 


CorT (pyrimidine) 


W 


AorT 


s 


CorG 


M 


AorC 


K 


GorT 


B 


CorGorT (not A) 


D 


AorGorT (notC) 


H 


A or CorT (notG) 


V 


A or CorG (notT) 


N 


A or C or G or T 



The Type 1 oligonucleotides shown in Table 4 are generally useful for hybridizing with DNA 
polymerase encoding polynucleotide segments. This may be conducted to detect the presence of the 
5 polynucleotide, or to prime an amplification reaction so that the polynucleotide can be characterized 
further. Suitable targets include polynucleotides encoding a region of a DNA polymerase from a wide 
spectrum of herpes viruses, including those in the alpha, beta, and gamma herpes viruses, those 
infecting any vertebrate animal, including humans and non-human primates, whether or not the 
polymerase or the virus has been previously known or described. Non-limiting examples include 
10 polynucleotides encoding DNA polymerase from any of the herpes viruses listed in Table 1. We have 
used these oligonucleotides to obtain segments of the DNA polymerase from RFHV, KSHV, EBV, 
HSV1, HHV6 and HHV7 - a group that includes representatives from the alpha, beta, and gamma 
subfamilies. 

The oligonucleotides may be used, inter alia, to prime a reaction to amplify a region of the target 
15 polynucleotide in the 3' direction from the site where the oligonucleotide hybridizes. DFASA, DFQSA, 
VYGA, VYGCA and GDTD1B are oligonucleotides with a consensus segment adjoining a degenerate 
segment, and are useful for that purpose, and also may be used when the sequence of the target DNA 
is unknown. Selection between oligonucleotides DFASA and DFQSA depends on the sequence of the 
target polynucleotide. DFQSA promotes amplification of HHV6-like sequences somewhat better than 
20 other sequences; DFASA promotes amplification of both HHV6- and non-HHV6-like sequences. VYGA 
has a broad cross-reactivity and is especially useful as a primer for a second amplification reaction 
preformed using polynucleotides first amplified by another primer, such as DFASA. VYGCA is a GC- 
rich analog of VYGA, producing less complex amplification mixtures and allowing hybridization 
reactions to occur at higher temperatures. VYGSQA and GDTDSQB are specific non-degenerate 
25 oligonucleotides which can be used, inter alia, to sequence amplification products made with VYGA or 
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GDTD1B, respectively; or for more specific amplification of a target polynucleotide after a preliminary 
amplification with a degenerate primer. 

A preferred source of DNA for use as a target for the oligonucleotides of Table 4 is any biological 
sample (including solid tissue and tissue cultures), particularly of vertebrate animal origin, known or 
5 suspected to harbor a herpes virus. DNA is extracted from the source by any method known in the art, 
including extraction with organic solvents or precipitation at high salt concentration. 

A preferred method of amplification is a polymerase chain reaction: see generally U.S. Patent No. 
4,683,195 (Mullis) and U.S. 4,683,202 (Mullis et al.); see U.S. 5,176,995 (Sninsky et al.) for application 
to viral polynucleotides. An amplification reaction may be conducted by combining the target 

10 polynucleotide to be amplified with short oligonucleotides capable of hybridizing with the target and 
acting as a primer for the polymerization reaction. Also added are substrate mononucleotides and a 
heat-stable DNA-dependent DNA polymerase, such as Taq. The conditions used for amplification 
reactions are generally known in the art, and can be optimized empirically using sources of known 
viruses, such RFHV, KSHV, EBV or HSV1. Conditions can be altered, for example, by changing the 

15 time and temperature of the amplification cycle, particularly the hybridization phase; changing the 
molarity of the oligonucleotide primers; changing the buffer concentration; and changing the number of 
amplification cycles. Fine-tuning the amplification conditions is a routine matter for a practitioner of 
ordinary skill in the art. 

In one method, a single primer of this invention is used in the amplification, optionally using a 

20 second primer, such as a random primer, to initiate replication downstream from the first primer and in 
the opposite direction. In a preferred method, at least two of the primers of this invention are used in 
the same reaction to initiate replication in opposite directions. The use of at least two specific primers 
enhances the specificity of the amplification reaction, and defines the size of the fragment for 
comparison between samples. For example, amplification may be performed using primers DFASA 

25 and GDTD1 B. More preferred is the use of all three primers in a nested fashion to enhance the 
amplification. Nesting is accomplished by performing a first amplification using primers that encompass 
an intermediate fragment comprising a binding site for a third primer. This is followed by a second 
amplification using the third primer, thereby providing a final fragment that is a subfragment of the 
intermediate fragment. Particularly preferred is a first amplification using primer DFASA and primer 

30 GDTD1B, followed by a second amplification using primer VYGA and primer GDTD1B. When 
performed on a polynucleotide from a DNA polymerase gene of RFHV or KSHV, the size of the 
fragment is about 236 bases. 

The amplified polynucleotides can be characterized at any stage during the amplification reaction, 
for example, by size determination. Preferably, this is performed by running the polynucleotide on a gel 

35 of about 1-2% agarose. If present in sufficient quantity, the polynucleotide in the gel can be stained 
with ethidium bromide and detected under ultraviolet light. Alternatively, the polynucleotide can be 
labeled with a radioisotope such as 32 P or 35 S before loading on a gel of about 6% polyacrylamide, and 
the gel can subsequently be used to produce an autoradiogram. A preferred method of labeling the 
amplified polynucleotide is to end-label an oligonucleotide primer such as VYGA or VYGSQA with 32 P 
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using a polynucleotide kinase and gamma-[ 32 P]-ATP, and continuing amplification for about 5-15 
cycles. 

If desired, size separation may also be used as a step in the preparation of the amplified 
polynucleotide. This is particularly useful when the amplification mixture is found to contain artifact 
polynucleotides of different size, such as may have arisen through cross-reactivity with undesired 
targets. A separating gel, such as described in the preceding paragraph, is dried onto a paper backing 
and used to produce an autoradiogram. Positions of the gel corresponding to the desired bands on the 
autoradiogram are cut out and extracted by standard techniques. The extracted polynucleotide can 
then be characterized directly, cloned, or used for a further round of amplification. 

Unwanted polynucleotides in the mixture from an amplification reaction can also be proportionally 
reduced by shifting to more specific oligonucleotide primers. For example, an initial 3-5 cycles of 
amplification can be conducted using primers VYGA and GDTD1B at 1/5 to 1/25 the normal amount. 
Then a molar excess (for example, 50 pmol) of GDTDSQB and/or VYGSQA are added, and the 
amplification is continued for an additional 30-35 cycles. This reduces the complexity of the 
oligonucleotides present in the amplification mixture, and enables the reaction temperatures to be 
increased to reduce amplification of unwanted polynucleotides. 

Preferred examples of Type 2 oligonucleotides are listed in Table 6: 



TABLE 6: Type 2 Oligonucleotides Specific for Polynucleotides Encoding DNA Polymerase from 

Viruses 


.:>;-S:'Desig|l|| 










l^nenr!;;: 




• :LSGt3A 


TACGAAACCTTTGACCTNAGY 
GGNGG 

CGCAAGAACCTGGCCTCNTG 
YACNGAYCC 


26 


32 


DNA polymerase 
of the 
RFHV/KSHV 
subfamily 


5'->3' 


107 




29 


64 


5'->3' 


108 




GTCGCCTCTGGCATCCTNCC 
NTGYCTNAA 


29 


128 




5'-»3' 


21 




CAGGGCCGGAAGATGCTGG 
ARACRTCNCARGC 


32 


32 




5'->3* 


22 


^3)SPA 


TCTCAGGCGTTCGTAGARGG 
NATHTCNCC 


29 


96 




5'->3* 


109 



LSGGA, CTDPA, PCLNA, KM LEA and GISPA are all oligonucleotides with a consensus segment 
at the 5' end joined directly to a degenerate segment at the 3' end. They are capable of forming stable 
duplexes with a polynucleotide encoding DNA polymerase from either RFHV, KSHV, or from other 
viruses of the RFHV/KSHV subfamily. They can be used for any purpose in which such specificity is 
desired, such as the detection or amplification of polynucleotides from the RFHV/KSHV subfamily. 

In one application, these Type 2 oligonucleotides are used individually or in combination as 
amplification primers. In one example of this application, the oligonucleotides are used directly on DNA 
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obtained from a tissue sample to obtain a DNA polymerase segment derived from RFHV, KSHV, or 
closely related viruses, but not more distantly related viruses such as EBV, CMV or HSV. In another 
example, the DNA from a tissue sample is first amplified with a less specific set of probes, such as 
DFASA or VYGA, in combination with GDTD1B, One of the oligonucleotides of Table 6 is then used in 
5 a second round of amplification, thereby providing a sensitive nested amplification assay which is 
specific for RFHV, KSHV, and other members of the RFHV/KSHV subfamily. 

In another application, Type 2 oligonucleotides, or oligonucleotides comprising these sequences or 
fragments thereof, are used as probes in a detection assay. For example, they can be provided with a 
suitable label such as 32 P, and then used in a hybridization assay with a suitable target, such as DNA 
1 0 amplified using DFASA and/or VYGA, along with GDTD1 B. 

Preferred examples of Type 3 oligonucleotides are shown in Table 7: 



TABLE 7: Typ6 3 Oligonucleotides Specific for Polynucleotides Encoding DNA Polymerase from 


■. V;;-ibesig--:i : ;p; 
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■ Sequence 
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Orien- 
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SEQ 




CGTCGCTTCCGGCATCCTAC 
C 


21 


1 


RFHV DNA 
polymerase 


5'->3' 


13 


iupca 


GGCATCCTACCGTGCCTGAA 
C 


21 


1 




5'-»3' 


14 




CCGGAGACGCCTCGATCGGT 
C 


21 


1 




3'->5* 


15 




AACCTGGCTTCCGGAGACGC 
C 


21 


1 




3'-»5' 


16 


sgila; ; 


GCGTTGCCTCTGGCATACTG 


20 


1 


KSHV DNA 
polymerase 


5'->3' 


17 














CLiNIA 


CTGCCTTGCCTAAACATAGC 
G 


21 


1 




5'->3' 


18 


;.c:ieasb;;|| 


GGTGAGACGTCTATTGGCCT 


20 


1 




3'->5' 


19 




AATCGGGCGTCGGGTGAGAC 
G 


21 


1 




3'->5' 


20 



These are non-degenerate oligonucleotides designed to be specific for DNA polymerase encoding 
1 5 polynucleotides of particular herpes viruses; namely RFHV or KSHV. The particular sequence chosen 
is from a segment of the encoding region that is more different from that of the other virus than 
neighboring segments. 

VASGA, ILPCA, PIEAB, and PEARB are specific non-degenerate oligonucleotides for the RFHV 
DNA polymerase, and can be used in hybridization reactions conducted at high stringency. For 
20 example, they can be used alone or in combination as primers for amplifying a target polynucleotide 
encoding RFHV DNA polymerase. Preferably, the amplification is done using the oligonucleotides in a 
nested fashion: e.g., a first amplification is conducted using VASGA and PEARB as primers; then a 
second amplification is conducted using ILPCA and PIEAB as primers. 
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Similarly, SGILA, CLNIA, IEASB, and EARFB are specific non-degenerate oligonucleotides for the 
KSHV DNA polymerase, and can be used in a similar fashion, including as primers for an amplification 
reaction. Preferably, the amplification is done using the oligonucleotides in a nested fashion: e.g., a 
first amplification is conducted using SGILA and EARFB as primers; then a second amplification is 
5 conducted using CLNIA and IEASB as primers. This provides an extremely sensitive amplification 
assay that is specific for KSHV DNA polymerase. 

Practitioners skilled in the art will immediately recognize that oligonucleotides of Types 1 , 2, and 3 
(in particular, those shown in Tables 4, 6, and 7) can be used in combination with each other in a PCR 
to amplify different sections of a DNA polymerase encoding polynucleotide. The specificity of the 
10 amplification reaction generally is determined by the primer with the least amount of cross reactivity. 
The size and location of the amplified fragment is determined by the primers used in the final round of 
amplification. For example, LSSGA used in combination with GDTD1B will amplify about 361 bases of 
DNA polymerase encoding polynucleotide from a virus of the RFHV/KSHV subfamily. Similarly, VYGA 
used in combination with PEARB will amplify about 444 bases of DNA polymerase encoding 
15 polynucleotide from RFHV. Suitable combinations of oligonucleotides may be used as amplification 
primers in a nested fashion. 

Use of synthetic oligonucleotides to characterize polynucleotide targets 

20 As described in the previous section, the oligonucleotides embodied in this invention, can be used as 

primers for amplification of polynucleotides encoding a herpes virus DNA polymerase, particularly in a 
polymerase chain reaction. 

The conditions for conducting the PCR depend on the nature of the oligonucleotide being used. In 
particular, when using oligonucleotides comprising a degenerate segment, or a consensus segment that is 

25 only partly identical to the corresponding segment of the target, and when the target polynucleotide 
comprises an unknown sequence, the selection of conditions may be important to the success of the 
amplification. Optimizing conditions for a new primer or new polynucleotide target are routine for a 
practitioner of ordinary skill. What follows is a guide to assist in that objective. 

First, the temperature of the annealing step of the PCR is optimized to increase the amount of target 

30 polynucleotide being amplified above the amount of unrelated polynucleotide amplified. Ideally, the 
temperature permits the primers to hybridize with the target sequence but not with other sequences. For 
primers comprising a consensus segment, the temperature of the annealing step is generally at least about 
55°C; preferably it is at least about 60°C. Primers which are virus-specific are more selective, and may be 
effective over a broader temperature range; between 50°C and 65°C. 

35 Second, the buffer conditions are optimized. We have found that buffers supplied with commercial 

preparations of Taq polymerase are sometimes difficult to use, in part because of a critical dependence on 
the concentration of magnesium ion. PCRs performed using the oligonucleotides of this invention 
generally are more easily performed using a buffer such as that suggested by M. Wigler (Lisitsyn et al.). 
Preferably, the final PCR reaction mixture contains (NH 4 ) 2 S0 4 instead of KCI as the principal ion source. 

40 Preferably, the concentration of (NH 4 ) 2 S0 4 in the final reaction mixture is about 5-50 mM, more preferably 
about 10-30 mM, even more preferably 16 mM. The buffering component is preferably Tris, preferably at a 
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final concentration of about 67 mM and a pH of about 8,8. Under these conditions, the MgCI 2 
concentration is less critical. Preferably the final concentration is about 1-10 mM, more preferably it is 
about 3-6 mM, optimally it is about 4 mM. The reaction mixture may also contain about 10 mM fc- 
mercaptoethanol and 0.05-1 mg/mL bovine serum albumin. An especially preferred buffer is WB4 buffer 
5 (67 mM Tris buffer pH 8.8, 4 mM MgCI 2 , 16 mM (NH 4 ) 2 S0 4j 10 mM G-mercaptoethanol and 0.1 mg/mL 
albumin. Preferred conditions for performing the reaction are provided below in Example 3. 

Amplification reactions using any the oligonucleotides of this invention as primers yield polynucleotide 
fragments encoding a portion of a DNA polymerase. These fragments can be characterized by a number 
of techniques known to a practitioner of ordinary skill in the art. Some non-limiting methods for 
1 0 characterizing a fragment are as follows: 

In one method, a fragment may be sequenced according to any method of sequence determination 
known in the art, including the Maxam & Gilbert method, or the Sanger & Nicholson method. Alternatively, 
the fragment may be submitted to any of the commercial organizations that provide a polynucleotide 
sequencing service. The fragment may optionally be cloned and/or amplified before sequencing. The 
15 nucleotide sequence can be used to predict the amino acid sequence encoded by the fragment. 
Sequence data can be used for comparison with other sequenced DNA polymerases, either at the 
polynucleotide level or the amino acid level, to identify the species of herpes virus present in the original 
source material. Sequence data can also be used in modeling algorithms to predict antigenic regions or 
three-dimensional structure. 

20 In a second method of characterizing, the size of the fragment can be determined by any suitable 

method, such as running on a polyacrylamide or agarose gel, or centrifuging through an appropriate 
density gradient. For example, for RFHV and KSHV, the fragment between VYGA and GDTD1 B is about 
172 bases. Hence, the length of the entire amplified fragment including primer binding regions is about 
236 bases. The corresponding EBV fragment contains an additional 9 base pairs. The EBV fragment can 

25 therefore be distinguished from that of RFHV or KSHV, for example, by running amplified polynucleotide 
fragments from each in neighboring lanes of a separating gel, or by running the EBV fragment beside 
suitable molecular weight standards. Polynucleotide fragments identical in size to that of RFHV and KSHV 
may be derived from a variant strain of one of these viruses, or a closely related species. Fragments 
substantially different in size are more likely to be derived from a different herpes virus. 

30 In a third method of characterizing, a fragment can be tested by attempting to hybridize it with an 

oligonucleotide probe. In a preferred example, a fragment is tested for relatedness to the DNA polymerase 
encoding region of RFHV or KSHV. The test is conducted using a probe comprising a sequence of a DNA 
polymerase encoding region, or its genetic complement. Suitable probes are polynucleotides comprising 
sequences from RFHV or KSHV, a mixture of such polynucleotides, or a polynucleotide comprising a 

35 degenerate sequence derived from RFHV and KSHV, such as the oligonucleotides listed in Table 6. 

The length and nature of the probe and the hybridization conditions are selected depending on the 
objectives of the test. If the objective is to detect only polynucleotides from RFHV or KSHV, including 
minor variants, then hybridization is performed under conditions of high stringency. A sequence from the 
respective RFHV or KSHV DNA polymerase is used. Longer length sequences improve the specificity of 

40 the test and can be used under conditions of higher stringency. Preferably, the probe will comprise a DNA 
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polymerase sequence of at least about 30 nucleotides; more preferably, the sequence will be at least 
about 50 nucleotides; even more preferably, the sequence will be at least about 75 nucleotides in length. 

If the objective is to detect polynucleotides that are related to RFHV or KSHV, such as in a screening 
test or a test to recruit previously undescribed viruses of the RFHV/KSHV subfamily, then different 
5 conditions are chosen. Sequences from RFHV or KSHV may be used, but a mixture of the two or a 
degenerate probe is generally preferred. The length of the sequence and the conditions of the 
hybridization reaction are selected to provide sufficient specificity to exclude unwanted sequences, but 
otherwise provide a maximum degree of cross-reactivity amongst potential targets. Suitable conditions 
can be predicted using the formulas given earlier, by calculating the T m and then calculating the 

10 corresponding temperature for the maximum degree of mismatch to be tolerated. The suitability of the 
conditions can be tested empirically by testing the cross-reactivity of the probes with samples containing 
known target polynucleotides encoding herpes DNA polymerases. 

The minimum degree of complementarity required for a stable duplex to form under the conditions of 
the assay will determine what DNA polymerase sequences will hybridize with the probe. Consider, for 

15 example, a target obtained from a human or non-human primate, amplified to produce a fragment 
corresponding to bases 330-501 of Figure 1 , and then probed with the corresponding fragment of the 
RFHV polynucleotide. According to the data in Table 2, if the hybridization reaction is performed under 
conditions that require only about 50% identity for a stable duplex to form, the probe may hybridize with 
targets from any of the sequenced gamma herpes DNA polymerase genes, including EBV and sHV1 . If 

20 the reaction is performed under conditions that require at least about 62% identity between probe and 
target, preferably at least about 65% identity, more preferably at least about 68% identity, and even more 
preferably at least about 70% identity for a stable duplex to form, the assay will detect a target 
polynucleotide from RFHV, KSHV, or from a related herpes virus DNA polymerase that has not yet been 
sequenced. Polynucleotides encoding DNA polymerase from EBV or sHV1 are not expected to form a 

25 stable duplex under these conditions. A polynucleotide encoding DNA polymerase from eHV2 is not 
expected to be present in the DNA tested, because eHV2 is not believed to be capable of infecting 
primates. 

It is possible to combine characterization by size and characterization by hybridization. For example, 
the amplified polynucleotide may be separated on a gel of acrylamide or agarose, blotted to a membrane 

30 of suitable material, such as nitrocellulose, and then hybridized with a probe with a suitable label, such as 
32 P. The presence of the label after washing reflects the presence of hybridizable material in the sample, 
while the migration distance compared with appropriate molecular weight standards reflects the size of the 
material. A fragment sequence hybridizing with one of the aforementioned probes under conditions of high 
stringency but having an unexpected size would indicate a DNA polymerase sequence with a high degree 

35 of identity to the probe, but distinct from RFHV or KSHV. 

Use of polynucleotides and oligonucleotides to detect herpes virus infection 

Polynucleotides encoding herpes virus DNA polymerase, and synthetic oligonucleotides based 
40 thereupon, as embodied in this invention, are useful in the diagnosis of clinical conditions associated with 
herpes virus infection. For example, the presence of detectable herpes DNA polymerase in a clinical 
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sample may suggest that the respective herpes virus participated as an etioiogic agent in the development 
of the condition. The presence of polymerase in a particular tissue, but not in surrounding tissue, may be 
useful in the localization of an infected lesion. Differentiating between gamma herpes virus and other 
herpes viruses in clinical samples, or differentiating between RFHV, KSHV, and EBV, may be useful in 
5 predicting the clinical course of an infection or selecting a drug suitable for treatment. 

In addition, since DNA polymerase is actively involved in the replication of the herpes virus, it may be 
preferred over other markers for certain applications. DNA polymerase is not expressed in the latent state 
of Varicella-Zoster herpes, but is expressed in the replicative state (Meier et aL). Thus, an assay for DNA 
polymerase may help determine whether an individual infected with gamma herpes is currently in an active 

1 0 phase of the disease. The capacity of a strain of HSV1 to move from the eye to the brain is related to DNA 
polymerase activity (Yeung et al.). Thus, an assay for DNA polymerase may help predict the 
aggressiveness or invasiveness of a gamma herpes infection. 

The procedures for conducting diagnostic tests are extensively known in the art, and are routine for a 
practitioner of ordinary skill. Generally, to perform a diagnostic method of this invention, one of the 

1 5 compositions of this invention is provided as a reagent to detect a target in a clinical sample with which it 
reacts. For example, a polynucleotide of this invention may be used as a reagent to detect a DNA or RNA 
target, such as might be present in a cell infected with a herpes virus. A polypeptide of this invention may 
be used as a reagent to detect a target with which it is capable of forming a specific complex, such as an 
antibody molecule or (if the polypeptide is a receptor) the corresponding ligand. An antibody of this 

20 invention may be used as a reagent to detect a target it specifically recognizes, such as a polypeptide 
expressed by virally infected cells. 

The target is supplied by obtaining a suitable tissue sample from an individual for whom the 
diagnostic parameter is to be measured. Relevant test samples are those obtained from individuals 
suspected of harboring a herpes virus. Many types of samples are suitable for this purpose, including 

25 those that are obtained near the suspected site of infection or pathology by biopsy or surgical dissection, in 
vitro cultures of cells derived therefrom, solubilized extracts, blood, and blood components. If desired, the 
target may be partially purified from the sample or amplified before the assay is conducted. The reaction is 
performed by contacting the reagent with the sample under conditions that will allow a complex to form 
between the reagent and the target. The reaction may be performed in solution, or on a solid tissue 

30 sample, for example, using histology sections. The formation of the complex is detected by a number of 
techniques known in the art. For example, the reagent may be supplied with a label and unreacted 
reagent may be removed from the complex; the amount of remaining label thereby indicating the amount 
of complex formed. Further details and alternatives for complex detection are provided in the descriptions 
that follow. 

35 To determine whether the amount of complex formed is representative of herpes infected or 

uninfected cells, the assay result is preferably compared with a similar assay conducted on a control 
sample. It is generally preferable to use a control sample which is from an uninfected source, and 
otherwise similar in composition to the clinical sample being tested. However, any control sample may be 
suitable provided the relative amount of target in the control is known or can be used for comparative 

40 purposes. It is often preferable to conduct the assay on the test sample and the control sample 
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simultaneously. However, if the amount of complex formed is quantifiable and sufficiently consistent, it is 
acceptable to assay the test sample and control sample on different days or in different laboratories. 

Accordingly, polynucleotides encoding DNA polymerase of the RFHV/KSHV subfamily, and the 
synthetic oligonucleotides embodied in this invention, can be used to detect gamma herpes virus 
5 polynucleotide that may be present in a biological sample. General methods for using polynucleotides in 
specific diagnostic assays are well known in the art: see, e.g., Patent Application JP 5309000 (latron). 

An assay employing a polynucleotide reagent may be rendered specific, for example: 1) by 
performing a hybridization reaction with a specific probe; 2) by performing an amplification with a specific 
primer, or 3) by a combination of the two. 

10 To perform an assay that is specific due to hybridization with a specific probe, a polynucleotide is 

chosen with the required degree of complementarity for the intended target. Preferred probes include 
polynucleotides of at least about 1 6 nucleotides in length encoding a portion of the DNA polymerase of 
RFHV, KSHV, or a member of the RFHV/KSHV subfamily. Increasingly preferred are probes comprising 
at least about 18, 20, 25, 30, 50, or 100 nucleotides of the DNA polymerase encoding region. Also 

15 preferred are degenerate probes capable of forming stable duplexes with polynucleotides of the 
RFHV/KSHV subfamily, but not with that of other herpes viruses. 

The probe is generally provided with a label. Some of the labels often used in this type of assay 
include radioisotopes such as 32 P and ^P, chemiluminescent or fluorescent reagents such as fluorescein, 
and enzymes such as alkaline phosphatase that are capable of producing a colored solute or precipitant. 

20 The label may be intrinsic to the reagent, it may be attached by direct chemical linkage, or it may be 
connected through a series of intermediate reactive molecules, such as a biotin-avidin complex, or a series 
of inter-reactive polynucleotides. The label may be added to the reagent before hybridization with the 
target polynucleotide, or afterwards. To improve the sensitivity of the assay, it is often desirable to 
increase the signal ensuing from hybridization. This can be accomplished by using a combination of 

25 serially hybridizing polynucleotides or branched polynucleotides in such a way that multiple label 
components become incorporated into each complex. See U.S. Patent No. 5,124,246 (Urdea et al.). 

If desired, the target polynucleotide may be extracted from the sample, and may also be partially 
purified. To measure viral particles, the preparation is preferably enriched for DNA; to measure active 
transcription of DNA polymerase, the preparation is preferably enriched for RNA. Generally, it is 

30 anticipated that the level of polynucleotide of a herpes virus will be low in clinical samples: there may be 
just a few copies of DNA encoding the polymerase per cell where the virus is latent, or up to several 
hundred copies of DNA per cell where the virus is replicating. The level of mRNA will be higher in cells 
where the polymerase is actively expressed than those where the polymerase gene is inactive. It may 
therefore be desirable to enhance the level of target in the sample by amplifying the DNA or RNA. A 

35 suitable method of amplification is a PCR, which is preferably conducted using one or more of the 
oligonucleotide primers embodied in this invention. RNA may be amplified by making a cDNA copy using 
a reverse transcriptase, and then conducting a PCR using the aforementioned primers. 

The target polynucleotide can be optionally subjected to any combination of additional treatments, 
including digestion with restriction endonucleases, size separation, for example by electrophoresis in 

40 agarose or polyacrylamide, and affixation to a reaction matrix, such as a blotting material. 
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Hybridization is allowed to occur by mixing the reagent polynucleotide with a sample suspected of 
containing a target polynucleotide under appropriate reaction conditions. This may be followed by washing 
or separation to remove unreacted reagent. Generally, both the target polynucleotide and the reagent 
must be at least partly equilibrated into the single-stranded form in order for complementary sequences to 
hybridize efficiently. Thus, it may be useful (particularly in tests for DNA) to prepare the sample by 
standard denaturation techniques known in the art. 

The level of stringency chosen for the hybridization conditions depends on the objective of the test. If 
it is desired that the test be specific for RFHV or KSHV, then a probe comprising a segment of the 
respective DNA polymerase is used, and the reaction is conducted under conditions of high stringency. 
For example, a preferred set of conditions for use with a preferred probe of 50 nucleotides or more is 6 x 
SSC at 37°C in 50% formamide, followed by a wash at low ionic strength. This will generally require the 
target to be at least about 90% identical with the polynucleotide probe for a stable duplex to form. The 
specificity of the reaction for RFHV or KSHV can also be increased by increasing the length of the probe 
used. Thus, longer probes are particularly preferred for this application of the invention. 

Alternatively, if it is desired that the test be able to detect gamma herpes viruses related to RFHV or 
KSHV, then a lower stringency is used. Suitable probes include fragments from the RFHV or KSHV DNA 
polymerase, a mixture thereof, or degenerate oligonucleotides such as those listed in Table 6. 

Appropriate hybridization conditions are determined to permit hybridization of the probe only to DNA 
polymerase sequences that have the desired degree of identity with the probe. The stringency required 
depends on the length of the polynucleotide probe, and the degree of identity between the probe and the 
desired target sequence. Consider, for example, a probe consisting of the KSHV polynucleotide fragment 
between the hybridization sites of DFASA and GDTD1B. Conditions requiring a minimum identity of 55% 
would result in a stable duplex formed with a corresponding polynucleotide of KSHV, RFHV, and EBV; 
conditions requiring a minimum identity of 68% would result in a stable duplex forming with a 
polynucleotide from KSHV, RFHV, or a related polynucleotide, but not EBV; conditions requiring a 
minimum identity of 80% would result in a stable duplex forming with a polynucleotide from KSHV, but not 
RFHV or EBV (see Table 2). 

Conditions can be estimated beforehand using the formula given earlier. Preferably, the exact 
conditions are confirmed by testing the probe with separate samples known to contain polynucleotides, 
both those desired to be detected and those desired to go undetected in the assay. Such samples may be 
provided either by synthesizing the polynucleotides from published sequences, or by extracting and 
amplifying DNA from tissues believed to be infected with the respective herpes virus. Determining 
hybridization conditions is a matter of routine adjustment for a practitioner of ordinary skill, and does not 
require undue experimentation. Since eHV2, sHV1 and EBV are more closely identical to RFHV or KSHV 
than members of the alpha and beta subfamilies, conditions that exclude polynucleotides of those viruses 
will generally also exclude the other herpes viruses listed in Table 1 . In addition, if it is believed that 
certain viruses will not be present in the sample to be tested in the ultimate determination (such as eHV2 in 
a human tissue sample), then the corresponding target sequences may optionally be omitted when 
working out the conditions of the assay. Thus, conditions can be determined that would permit an 
oligonucleotide probe such as LSGGA, CTDPA, KM LEA or GISPA to form a stable duplex both with 
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polypeptides comprising SEQ. ID NO:1 and SEQ. ID NO:3, but not a sequence selected from the group 
consisting of SEQ. ID NO:23 to SEQ. ID NO:29. Conditions can also be determined that would permit an 
oligonucleotide probe such as PCLNA (SEQ. ID NO:21) or any suitable fragment comprising at least 18 or 
more consecutive bases of SEQ. ID NO:1 or SEQ. ID NO:3 to form a stable duplex both with a 
5 polynucleotide comprising SEQ. ID NO:1 and with a polynucleotide comprising SEQ. ID NO:3, but not a 
polynucleotide comprising one of SEQ. ID NO:23 to SEQ. ID NQ:29. 

Alternatively, to conduct an assay that is specific due to amplification with a specific primer. DNA or 
RNA is prepared from the biological sample as before. Optionally, the target polynucleotide is 
pre-ampiified in a PCR using primers which are not species specific, such as those listed in Table 4. The 

1 0 target is then amplified using specific primers, such as those listed in Table 6 or Table 7. For example, if it 
is desired that the test be specific for RFHV, then VASGA, ILPCA, PIEAB, PEARB, or a combination 
thereof may be used. If it is desired that the test be specific for KSHV, then SGILA, CLNIA, IEASB, 
EARFB, or a combination thereof may be used. If it is desired that the test be able to detect gamma 
herpes viruses related to RFHV or KSHV, then degenerate or cross-reactive probes, such as those listed 

15 in Table 6, or a combination thereof may be used. In a preferred embodiment, two rounds of amplification 
are performed, using oligonucleotide primers in a nested fashion: virus-specific or non-specific in the first 
round; virus-specific in the second round. This provides an assay which is both sensitive and specific. 

Use of a specific primer during amplification is sufficient to provide the required specificity. A positive 
test may be indicated by the presence of sufficient reaction product at the end of the amplification series. 

20 Amplified polynucleotide can be detected, for example, by blotting the reaction mixture onto a medium 
such as nitrocellulose and staining with ethidium bromide. Alternatively, a radiolabeled substrate may be 
added to the mixture during a final amplification cycle; the incorporated label may be separated from 
unincorporated label (e.g., by blotting or by size separation), and the label may be detected (e.g. by 
counting or by autoradiography). If run on a gel of agarose or polyacrylamide, the size of the product may 

25 help confirm the identity of the amplified fragment. Specific amplification can also be followed by specific 
hybridization, by using the amplification mixture obtained from the foregoing procedure as a target source 
for the hybridization reaction outlined earlier. 

Use of polynucleotides for gene therapy 

30 

Embodied in this invention are pharmaceuticals comprising virus-specific polynucleotides, 
polypeptides, or antibodies as an active ingredient. Such compositions may decrease the pathology of the 
virus or infected cells on their own, or render the virus or infected cells more susceptible to treatment by 
non-specific pharmaceutical compounds. 

35 Polynucleotides of this invention encoding part of a herpes virus DNA polymerase may be used, for 

example, for administration to an infected individual for purposes of gene therapy (see generally U.S. 
Patent No. 5,399,346: Anderson et al.). The general principle is to administer the polynucleotide in such a 
way that it interferes with the expression of the corresponding gene, such as by complexing with the gene 
itself or with the RNA transcribed from the gene. Entry of the polynucleotide into the cell is facilitated by 

40 suitable techniques known in the art, such as providing the polynucleotide in the form of a suitable vector, 
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or encapsulation of the polynucleotide in a liposome. The polynucleotide may be injected systemically, or 
provided to the site of infection by an antigen-specific homing mechanism, or by direct injection. 

A preferred mode of gene therapy is to provide the polynucleotide in such a way that it will replicate 
inside the cell, enhancing and prolonging the interference effect. Thus, the polynucleotide is operatively 
5 linked to a suitable promoter, such as the natural promoter of the corresponding gene, a heterologous 
promoter that is intrinsically active in infected cells, or a heterologous promoter that can be induced by a 
suitable agent. Preferably, the construct is designed so that the polynucleotide sequence operatively 
linked to the promoter is complementary to the sequence of the corresponding gene. Thus, once 
integrated into the cellular genome, the transcript of the administered polynucleotide will be 
1 0 complementary to the transcript of the gene, and capable of hybridizing with it. This approach is known as 
anti-sense therapy. 

RFHV/KSHV subfamily polypeptides with DNA polymerase activity and fragments thereof 

15 The RFHV and KSHV polynucleotides shown in Figure 1 each have an open reading frame. The 

polypeptides encoded are respectively designated SEQ. ID NO:2 and SEQ. ID NO:4. The polypeptides 
have a significant number of homologous residues to DNA polymerases of other sequenced herpes 
viruses. They are more closely identical to each other within this fragment than to the corresponding 
fragment of the other sequenced viruses. The fragment is believed to encompass residues that are near 

20 the nucleotide substrate binding site of the intact protein. This region may play a role in the catalytic 
activity of the polymerase. Polypeptides with DNA polymerase activity from other members of the 
RFHV/KSHV subfamily are expected to share a large proportion of identical residues over this region. In 
general, residues conserved between RFHV and KSHV are expected to be relatively conserved within the 
subfamily. 

25 Beginning at about amino acid 89 of SEQ. ID NO:2, there is a linear sequence of about 46 residues 

that is shared identically between the DNA polymerase of RFHV and KSHV. Beginning at about amino 
acid 88 of SEQ. ID NO:2, there is a linear sequence of about 31 residues shared between the DNA 
polymerase of RFHV and eHV2. The sequence shared with eHV2 is listed separately in SEQ. ID NO:1 12. 
Also contained in SEQ. ID NO:112 is a sequence of about 26 amino acids shared between RFHV and 

30 sHV1 , and two sequences of 12 amino acids shared between RFHV and EBV. Beginning at about amino 
acid 10 of SEQ. ID NO:4, there is a linear sequence of about 15 residues shared between KSHV and 
various other gamma herpes viruses. This shared sequence is listed separately in SEQ. ID NO:1 13. The 
longest sequence contained in SEQ. ID NOS:2 or 4 but not in SEQ. ID NOS:112 or 113 that is shared with 
other known herpes virus DNA polymerases is 10 amino acids in length. Hence, any fragment of the 

35 RFHV or KSHV DNA polymerase protein sequence that is 1 1 amino acids or longer, and not in SEQ. ID 
NOS: 1 1 2 or 1 1 3, is believed to be specific for the RFHV/KSHV subfamily, or species and variants therein. 

This invention embodies both intact DNA polymerase from herpes viruses of the RFHV/KSHV 
subfamily, and any fragment thereof that is specific for the subfamily. Preferred DNA polymerase 
fragments of this invention are at least 1 1 amino acids in length; more preferably they are about 12 amino 

40 acids in length, more preferably they are at least about 15 amino acids in length; even more preferably 
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they are at least about 20 amino acids in length, still more preferably they are at least about 30 amino 
acids in length. 

The amino acid sequence of the RFHV and KSHV DNA polymerase fragments can be used to 
identify virus-specific and cross-reactive antigenic regions. 

In principle, a specific antibody could recognize any amino acid difference between sequences that is 
not also shared by the species from which the antibody is derived. Antibody binding sites are generally big 
enough to encompass 5-9 amino acid residues of an antigen, and are quite capable of recognizing a single 
amino acid difference. Specific antibodies may be part of a polyclonal response arising spontaneously jn 
animals infected with a virus expressing the DNA polymerase. Specific antibodies may also be induced by 
injecting an experimental animal with either the intact polymerase or a polymerase fragment. 

Thus, any peptide of 5 amino acids or more that is unique to RFHV or KSHV is a potential 
virus-specific antigen, and could be recognized by a RFHV- or KSHV-specific antibody. Peptides of at 
least 5 amino acids shared between RFHV and KSHV, but not EBV, eHV2 and sHV1 are potential 
RFHV/KSHV subfamily specific antigens. 

Some examples of preferred peptides are shown in Table 8. Practitioners in the art will immediately 
recognize that other peptides with similar specificities may be designed by minor alterations to the length 
of the peptides listed and/or moving the frame of the peptide a few residues in either direction. 

The Class I peptides shown in Table 8 are conserved between all known DNA polymerase 
polypeptide sequences of the gamma herpes virus subfamily. An antibody directed against one such DNA 
polymerase in this region is expected to cross-react with the others. Class II peptides are conserved 
between RFHV and KSHV, but not with sHV1 and EBV. An antibody directed against this region is 
expected to cross-react between RFHV, KSHV, and other viruses of the RFHV/KSHV subfamily. Class III 
peptides are different between RFHV and KSHV. An antibody binding to this region, particularly to 
non-identical residues, is expected to distinguish the RFHV DNA polymerase from the KSHV DNA 
polymerase. 
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Peptides contained within 
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QRPIEAS 


(RFHV) 


86 


RRPIDVS 


(KSHV) 


87 


IEASPEA 


(RFHV) 


88 


IDVSPDA 


(KSHV) 


89 



Specificity 



1 - Not shared with eHV2, sHV1 or EBV, except where indicated 

2 - Also shared with eHV2 but not with sHV1 or EBV 

3 - Not shared with any other sequenced herpes virus; may be 

present in some unsequenced RFHV/KSHV subfamily viruses 



Particularly preferred peptides from Classes II and Ml are QGRKMLE, ARFKVI, RRPIDVS, 
QRPIEAS, IEASPEA, and IDVSPDA. Given the complete sequence of a DNA polymerase from RFHV 
and/or KSHV, virus- or subfamily-specific peptides can be predicted for other regions of the molecule by 
a similar analysis. 



Preparation of polypeptides 

Polypeptides of this invention, including intact protein, protein fragments, and antigenic regions, can 
10 be prepared by several different methods, all of which will be known to a practitioner of ordinary skill. For 
example, the appropriate strand of the full-length cDNA can be operatively linked to a suitable promoter, 
and transfected into a suitable host cell. The host cell is then cultured under conditions that allow 
transcription and translation to occur, and the polypeptide is subsequently recovered. For a description of 
the expression and recovery of a herpes virus DNA polymerase by transfecting S. cenevisiae, see Haffey 
15 et al. and Patent Application EP 0337441. For a description of the expression of another herpes virus 
protein in mammalian cells, see U.S. Patent No. 5,244,792 (Burke et al.). 

Polypeptides may also be prepared directly from sequence data by chemical synthesis. Several 
methods of synthesis are known in the art. A preferred method is the solid-phase Merrifield technique. 
Alternatively, a polynucleotide encoding the desired polypeptide may be prepared by any of the methods 
20 described earlier, and translated using an in vitro translation system, such as the rabbit reticulocyte 
system. See, e.g., Dorsky et ai. 



Use of polypeptides to assess herpes virus infection 

25 The polypeptides embodied in this invention may be used to detect or assess the status of a herpes 

virus infection in an individual in several different applications. 

In one application, a polypeptide encoding a portion of a herpes virus DNA polymerase is supplied as 
a reagent for an assay to detect the presence of antibodies that can specifically recognize it. Such 
antibodies may be present, for example, in the circulation of an individual with current or past herpes virus 

30 infection. 
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The presence of antibodies to DNA polymerase in the circulation may provide an early indication of a 
pathological condition. The antibody to hepatitis B virus DNA polymerase is an early indication of acute 
hepatitis B virus infection (WO 8904964: Fietelson et al.). Antibodies to DNA polymerase are useful in 
diagnosis of nasopharyngeal carcinoma (Lin et al., Liu et al.). Similarly, it may be useful to monitor for the 
5 presence of antibodies to DNA polymerase of KSHV in HIV-infected humans before Kaposi's sarcoma 
lesions are clinically apparent. 

Suitable clinical samples in which to measure antibody levels include serum or plasma from an 
individual suspected of having a gamma herpes virus infection. The presence of the antibody is 
determined, for example, by an immunoassay. 

10 A number of immunoassay methods are established in the art for performing the quantitation of 

antibody using viral peptides (see, e.g., U.S. Patent No. 5,350,671 : Houghton et al.). For example, the test 
sample potentially containing the specific antibody may be mixed with a pre-determined non-limiting 
amount of the reagent polypeptide. The reagent may contain a directly attached label, such as an enzyme 
or a radioisotope. For a liquid-phase assay, unreacted reagents are removed by a separation technique, 

1 5 such as filtration or chromatography. Alternatively, the antibody in the sample may be first captured by a 
reagent on a solid phase. This may be, for example, the specific polypeptide, an anti-immunoglobulin, or 
protein A. The captured antibody is then detected with a second reagent, such as the specific polypeptide, 
anti-immunoglobulin, or protein A with an attached label. At least one of the capture reagent or the 
detecting reagent must be the specific polypeptide. In a third variation, cells or tissue sections containing 

20 the polypeptide may be overlaid first with the test sample containing the antibody, and then with a 
detecting reagent such as labeled anti-immunoglobulin. In all these examples, the amount of label 
captured in the complex is positively related to the amount of specific antibody present in the test sample. 
Similar assays can be designed in which antibody in the test sample competes with labeled antibody for 
binding to a limiting amount of the specific peptide. The amount of label in the complex is then negatively 

25 correlated with the amount of specific antibody in the test sample. Results obtained using any of these 
assays are compared between test samples, and control samples from an uninfected source. 

By selecting the reagent polypeptide appropriately, antibodies of a desired specificity may be 
detected. For example, if the intact DNA polymerase is used, or a fragment comprising regions that are 
conserved between herpes virus, then antibodies detected in the test samples may be virus specific, 

30 cross-reactive, or both. A reagent of this nature is preferred for a general screening assay for herpes virus 
infection. To render the assay specific for antibodies directed either against RFHV or against KSHV, 
antigen peptides comprising non-conserved regions of the appropriate viral DNA polymerase are selected, 
such as those listed in Class III of Table 8. Preferably, a mixture of such peptides is used. To 
simultaneously detect antibodies against RFHV, KSHV, and closely related viruses of the gamma herpes 

35 family, but not sHV1 and EBV, antigen peptides are selected with the properties of those listed in Class II 
of Table 8. Preferably, a mixture of such peptides is used. 

Antibodies stimulated during a herpes virus infection may subside once the infection resolves, or they 
may persist as part of the immunological memory of the host. In the latter instance, antibodies due to 
current infection may be distinguished from antibodies due to immunological memory by determining the 

40 class of the antibody. For example, an assay may be conducted in which antibody in the test sample is 
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captured with the specific polypeptide, and then developed with labeled anti-IgM or anti-IgG. The 
presence of specific antibody in the test sample of the IgM class indicates ongoing infection, while the 
presence of IgG antibodies alone indicates that the activity is due to immunological memory of a previous 
infection or vaccination. 

5 In another application of the invention, herpes virus encoded DNA polymerase is isolated from a 

sample, and the amount present is determined by an enzymatic assay. Assays for DNA polymerase 
activity in a biological sample can be conducted, for example, by extracting the polymerase and performing 
a suitable polymerization assay. The polymerase may be solubilized by standard techniques from a solid 
tissue sample or tissue homogenate, for example, by using non-ionic detergents such as TRITON™ X-100 

10 or deoxycholate. Alternatively, if the polymerase is secreted by infected cells, it may be possible to 
perform the assay on a liquid sample, such as plasma or lymph. 

Methods for conducting DNA polymerase assays are known in the art. For example, a 
polymerization mixture is prepared that contains the putative DNA polymerase, a mixture of nucleotides 
containing at least one labeled nucleotide, a DNA template such as M13 phage DNA, and, if necessary, a 

1 5 regulatory subunit. The mixture is incubated at 37°C for a time sufficient to allow polymerization to occur. 
Polymerase activity, or lack thereof, is determined by measuring the amount of incorporation of label into 
the polynucleotide. Optimal conditions for conducting a DNA polymerase assay are readily ascertained 
without undue experimentation by a practitioner of ordinary skill in the art. For example, conditions for the 
DNA polymerase of HSV have been published by O'Donnell et al. Optimal conditions for DNA polymerase 

20 from RFHV and KSHV are expected to be analogous. 

Use of polypeptides as components in active vaccines 

An example of how polypeptides embodied in this invention can be effectively used in treatment is 
25 through vaccination. 

In one embodiment of this application, the polypeptide is administered as part of an active vaccine in 
order to stimulate antibodies that will react against the pathogenic organism; in this case, a herpesvirus of 
the RFHV/KSHV subfamily. The development of active vaccines from isolated herpes virus components is 
known in the art: see, e.g., U.S. Patent Nos. 5,171,568 (Berke et al.). This type of vaccine is especially 

30 useful in prophylaxis, since the antibodies it stimulates may be able to neutralize subsequently 
encountered organisms before they have a chance to invade the host's cells and begin a replicative cycle. 

Methods for preparing and administering polypeptide vaccines are known in the art. Peptides may be 
capable of eliciting an immune response on their own, or they may be rendered more immunogenic by 
chemical manipulation, such as cross-linking or attaching to a protein carrier like KLH. Preferably, the 

35 vaccine also comprises an adjuvant, such as alum, muramyl dipeptides, liposomes, or DETOX™. The 
vaccine may optionally comprise auxiliary substances such as wetting agents, emulsifying agents, and 
organic or inorganic salts or acids. It also comprises a pharmaceutical^ acceptable excipient which is 
compatible with the active ingredient and appropriate for the route of administration. The desired dose for 
peptide vaccines is generally from 10 ng to 1 mg, with a broad effective latitude. The vaccine is preferably 

40 administered first as a priming dose, and then again as a boosting dose, usually at least four weeks later. 
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Further boosting doses may be given to enhance the effect. The dose and its timing are usually 
determined by the person responsible for the treatment. 

In another embodiment of this application, the polypeptide is an active ingredient of a vaccine 
designed to stimulate specific cytotoxic T lymphocytes. This type of vaccine may be especially useful in 
the treatment of a herpes virus infection already present in a subject. The DNA polymerase of a herpes 
virus is a suitable target for a cytotoxic T cell vaccine; not only because of its relatively conserved 
structure, but also because it is an important internal component of the virus. External virus components 
are expressed by circulating intact virus and defective viral particles, and have the potential of diverting the 
immune system away from infected cells. However, internal viral components are expressed 
extracellulariy only by virally infected cells, which display them in the context of histocompatibility class I 
molecules. Such cells are ideal targets for specific cytotoxic T cells, as they represent the site of viral 
replication and are therefore the virus's most vulnerable location within the host. 

Cytotoxic T cell vaccines may comprise different antigenic regions than those required to stimulate 
antibodies. T cell epitopes are different from antibody epitopes; they generally depend less on 
conformational context, and/or develop from regions of the peptide capable of folding into an amphipathic 
alpha-helix. Cytotoxic T cell vaccines may also comprise additional active ingredients or cytokines which 
may enhance the presentation of the peptide to the T cell population and/or assist in the recruitment of 
cells of the cytotoxic T cell lineage. 

In a variation of either of the preceding embodiments, the immunizing peptide is provided not as an 
isolated protein or protein fragment, but in the form of an expression vector. A polynucleotide encoding the 
peptide is operatively linked to suitable controlling elements of transcription and translation, and then 
transfected into a suitable vector. Suitable vectors include a vaccinia virus, or an attenuated form of a 
herpes virus. 

Use of polypeptides to design or screen anti-viral drugs 

Interfering with the DNA polymerase gene or gene product would modify the infection process, or the 
progress of this disease. It is an objective of this invention to provide a method by which useful 
pharmaceutical compositions and methods of employing such compounds in the treatment of gamma 
herpes virus infection can be developed and tested. Particularly preferred are pharmaceutical compounds 
useful in treating infections by RFHV and KSHV. Suitable drugs are those that interfere with transcription 
or translation of the DNA polymerase gene, and those that interfere with the catalytic function of the 
polypeptide encoded by the gene. It is not necessary that the mechanism of interference be known; only 
that the interference be preferential for reactions associated with the infectious process. 

Preferred drugs include those that competitively interfere with the binding of the DNA polymerase to 
the substrate nucleotide triphosphate, the DNA template. Also preferred are nucleotide analogs that can 
be incorporated into the polymerizing strand synthesized by the enzyme, but form a dead-end complex 
that prevents further polymerization (Reardon et al.) Some non-limiting examples of preferred drugs which 
may be tested by the procedures described herein are aphidicolon, acyclovir, gancyclovir, foscarnet, 
oosporein, BHCG, PMEA, other nucleotide analogs, isotrenes of these compounds, and other compounds 
that are structurally or functionally related to those listed. 
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Also preferred are drugs that interfere with the association of DNA polymerase with regulatory 
subunits that are necessary for catalytic activity. As described earlier, the UL42 subunit is essential for the 
DNA polymerase activity of HSV during the replicative process. Small peptides designed from the UL42 
sequence inhibit binding between UL42 and the DNA polymerase, and are effective inhibitors of 
5 polymerase activity (U.S. Patent No. 5,223,391: Coen et al.). The C-terminal region of the HSV DNA 
polymerase is responsible for binding the UL42 subunit. It is therefore expected that under certain 
conditions (such as those required for viral replication), the RFHV and KSHV DNA polymerase will require 
a regulatory subunit which may or may not be an analog of UL42, in order to express full polymerase 
activity. Thus, peptides functionally equivalent to those described in U.S. 5,223,391 , adapted appropriately 

1 0 for gamma herpes viruses, are expected to have inhibitory activity and be therapeutically useful. 

This invention provides methods for screening pharmaceutical candidates to determine which are 
suitable for clinical use. The methods may be brought to bear on antiviral compounds that are currently 
known, and those which may be designed in the future. 

The method involves combining an active DNA polymerase with the pharmaceutical candidate, and 

1 5 determining whether the biochemical function is altered by the pharmaceutical candidate. The DNA 
polymerase may be any fragment encoded by the DNA polymerase gene of RFHV or KSHV that has DNA 
polymerase activity. Suitable fragments may be obtained by expressing a genetically engineered 
polypeptide encoding the active sites of the molecule, or by cleaving the DNA polymerase with proteases 
and purifying the active fragments. In a preferred embodiment, the entire DNA polymerase is provided. 

20 The reaction mixture will also comprise a suitable DNA template, substrate deoxyribonucleotide 
triphosphates, and whatever regulatory subunits are necessary for the reaction to proceed. One 
embodiment of the screening method is to perform a DNA polymerase assay in vitro. The DNA 
polymerase is provided in isolated form, and mixed with the other reacting compounds in a suitable buffer. 
A DNA polymerase assay is conducted and monitored as outlined in an earlier section. The amount of 

25 polymerase activity per mole or per gram of enzyme in the reaction mixture is measured, for example, by 
the rate of incorporation of radiolabeled nucleotide into the synthesized strand. The effect of the candidate 
drug may be determined by running two reactions in parallel, both with the same mixture of reacting 
substances except that one contains the candidate. Alternatively, the effect of the candidate drug may be 
determined by adding it to a polymerase reaction in progress, and determining whether the reaction rate is 

30 altered. A desirable effect is one that eliminates or decreases the rate of synthesis of the labeled DNA. 

Another embodiment of the screening method is to express a polynucleotide encoding an active 
region of the DNA polymerase in a host cell. Transfection with the polynucleotide may enhance the rate of 
replication of the host cell, in which case the activity of the polymerase can be monitored by measuring the 
rate of replication of the cells. Alternatively, activity of the polymerase may be measured as the rate of 

35 production of a product, such as a labeled polynucleotide, inside the cell. The effect of the drug can 
therefore be determined by following its effect on DNA polymerase activity. Suitable control experiments 
include measuring DNA polymerase activity in the absence of the drug, and measuring the effect of the 
drug on untransformed host cells. 

A further embodiment of the screening method is to measure binding of the pharmaceutical candidate 

40 to the isolated DNA polymerase, or a fragment thereof. Compounds that bind to the catalytic site or the 
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binding site of a regulatory subunit are expected to interfere with DNA polymerase activity. Thus, the 
entire DNA polymerase, or a fragment comprising the catalytic site or the binding site of a regulatory 
subunit, is mixed with the pharmaceutical candidate. Binding of the candidate can be measured directly, 
for example, by providing the candidate in a radiolabeled or stable-isotope labeled form. The presence of 
5 label bound to the polymerase can be determined, for example, by precipitating the polymerase with a 
suitable antibody, or by providing the polymerase attached to a solid phase, and washing the solid phase 
after the reaction. Binding of the candidate to the polymerase may also be observed as a conformational 
change in the polymerase, detected for example by difference spectroscopy, nuclear magnetic resonance, 
or circular dichroism. Alternatively, binding may be determined in a competitive assay: for example, DNA 

1 0 polymerase is mixed with the candidate, and then labeled nucleotide or a fragment of a regulatory subunit 
is added later. Binding of the candidate to the biochemically relevant site should inhibit subsequent 
binding of the labeled compound. 

This invention also provides for the development of pharmaceuticals for the treatment of herpes 
infection by rational drug design. See, generally, Hodgson, and Erickson et al. In this embodiment, the 

15 three-dimensional structure of the DNA polymerase is determined, either by predictive modeling based on 
the amino acid sequence, or preferably, by experimental determination. Experimental methods include 
antibody mapping, mutational analysis, and the formation of anti-idiotypes. Especially preferred is X-ray 
crystallography. Knowing the three-dimensional structure of the protease, especially the orientation of 
important amino acid groups near the nucleotide and regulatory subunit binding sites, a compound is 

20 designed de novo, or an existing compound is suitably modified. The designed compound will have an 
appropriate charge balance, hydrophobicity, and/or shape to enable it to attach near an active site of the 
polymerase, and sterically interfere with the normal biochemical function of that site. Preferably, 
compounds designed by this method are subsequently tested in a drug screening assay, such as those 
outlined above. 

25 

Antibodies against DNA polymerase and their preparation 

The amino acid sequence of the herpes virus DNA polymerases embodied herein are foreign to the 
hosts they infect. The polymerases are large, and potentially comprise a large number of antigenic 

30 regions. They are sequestered within the capsid of the respective virus, and are unlikely to be mimicking 
host antigens. It is therefore expected that these polymerases will be substantially immunogenic. 
Antibodies may be generated against them spontaneously by a vertebrate host during the course of an 
infection with an intact herpes virus. It is also expected that antibodies can be raised in experimental 
animals by injection of isolated DNA polymerase and suitably prepared fragments. These expectations 

35 are supported by the observations described in Example 5 and Example 10. 

Antibodies against a polypeptide are generally prepared by any method known in the art. To 
stimulate antibody production in an animal experimentally, it is often preferable to enhance the 
immunogenicity of a polypeptide by such techniques as polymerization with glutaraldehyde, or combining 
with an adjuvant, such as Freund's adjuvant. The immunogen is injected into a suitable experimental 

40 animal: preferably a rodent for the preparation of monoclonal antibodies; preferably a larger animal such 
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as a rabbit or sheep for preparation of polyclonal antibodies. It is preferable to provide a second or booster 
injection after about 4 weeks, and begin harvesting the antibody source no less than about 1 week later. 

Sera harvested from the immunized animals provide a source of polyclonal antibodies. Detailed 
procedures for purifying specific antibody activity from a source material are known within the art. If 
5 desired, the specific antibody activity can be further purified by such techniques as protein A 
chromatography, ammonium sulfate precipitation, ion exchange chromatography, high-performance liquid 
chromatography and immunoaffinity chromatography on a column of the immunizing polypeptide coupled 
to a solid support. 

Polyclonal antibodies raised by immunizing with an intact DNA polymerase or a fragment comprising 

10 conserved sequences may be cross-reactive between herpes viruses. Antibodies that are virus or 
subfamily specific may be raised by immunizing with a suitably specific antigen, such as those listed above 
in Table 8. Alternatively, polyclonal antibodies raised against a larger fragment may be rendered specific 
by removing unwanted activity against other virus DNA polymerases, for example, by passing the 
antibodies over an adsorbant made from those polymerases and collecting the unbound fraction. 

15 Alternatively, immune cells such as splenocytes can be recovered from the immunized animals and 

used to prepare a monoclonal antibody-producing cell line. See, for example, Harrow & Lane (1988), U.S. 
Patent Nos. 4,472,500 (Milstein et al.), and U.S. 4,444,887 (Hoffman et al.) 

Briefly, an antibody-producing line can be produced inter alia by cell fusion, or by transforming 
antibody-producing cells with Epstein Barr Virus, or transforming with oncogenic DNA. The treated cells 

20 are cloned and cultured, and clones are selected that produce antibody of the desired specificity. 
Specificity testing can be performed on culture supematants by a number of techniques, such as using the 
immunizing polypeptide as the detecting reagent in a standard immunoassay, or using cells expressing the 
polypeptide in immunohistochemistry. A supply of monoclonal antibody from the selected clones can be 
purified from a large volume of tissue culture supernatant, or from the ascites fluid of suitably prepared 

25 host animals injected with the clone. 

Effective variations of this method include those in which the immunization with the polypeptide is 
performed on isolated cells. Antibody fragments and other derivatives can be prepared by methods of 
standard protein chemistry, such as subjecting the antibody to cleavage with a proteolytic enzyme. 
Genetically engineered variants of the antibody can be produced by obtaining a polynucleotide encoding 

30 the antibody, and applying the general methods of molecular biology to introduce mutations and translate 
the variant. 

Monoclonal antibodies raised by injecting an intact DNA polymerase or a fragment comprising 
conserved sequences may be cross-reactive between herpes viruses. Antibodies that are virus or 
subfamily specific may be raised by immunizing with a suitably specific antigen, as may be selected from 
35 Table 8. Alternatively, virus-specific clones may be selected from the cloned hybridomas by using a 
suitable antigen, such as one selected from Table 8, in the screening process. 

Use of antibodies for detecting DNA polymerase in biological samples 

40 Antibodies can be used to detect DNA polymerase polypeptides and fragments of viral origin that 

may be present, for example, in solid tissue samples and cultured cells. Immunohistological techniques to 
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carry out such determinations will be obvious to a practitioner of ordinary skill. Generally, the tissue is 
preserved by a combination of techniques which may include freezing, exchanging into different solvents, 
fixing with agents such as paraformaldehyde, drying with agents such as alcohol, or embedding in a 
commercially available medium such as paraffin or OCT. A section of the sample is suitably prepared and 
5 overlaid with a primary antibody specific for the protein. 

The primary antibody may be provided directly with a suitable label. More frequently, the primary 
antibody is detected using one of a number of developing reagents which are easily produced or available 
commercially. Typically, these developing reagents are anti-immunoglobulin or protein A, and they 
typically bear labels which include, but are not limited to: fluorescent markers such as fluorescein, 

10 enzymes such as peroxidase that are capable of precipitating a suitable chemical compound, electron 
dense markers such as colloidal gold, or radioisotopes such as 125 l. The section is then visualized using 
an appropriate microscopic technique, and the level of labeling is compared between the suspected virally 
infected and a control cell, such as cells surrounding the area of infection or taken from a remote site. 

Proteins encoded by a DNA polymerase gene can also be detected in a standard quantitative 

1 5 immunoassay. If the protein is secreted or shed from infected cell in any appreciable amount, it may be 
detectable in plasma or serum samples. Alternatively, the target protein may be solubilized or extracted 
from a solid tissue sample. Before quantitating, the protein may optionally be affixed to a solid phase, 
such as by a blot technique or using a capture antibody. 

A number of immunoassay methods are established in the art for performing the quantitation. For 

20 example, the protein may be mixed with a pre-determined non-limiting amount of the reagent antibody 
specific for the protein. The reagent antibody may contain a directly attached label, such as an enzyme or 
a radioisotope, or a second labeled reagent may be added, such as anti-immunoglobulin or protein A. For 
a solid-phase assay, unreacted reagents are removed by washing. For a liquid-phase assay, unreacted 
reagents are removed by some other separation technique, such as filtration or chromatography. The 

25 amount of label captured in the complex is positively related to the amount of target protein present in the 
test sample. A variation of this technique is a competitive assay, in which the target protein competes with 
a labeled analog for binding sites on the specific antibody. In this case, the amount of label captured is 
negatively related to the amount of target protein present in a test sample. Results obtained using any 
such assay are compared between test samples, and control samples from an uninfected source. 

30 Specific antibodies against herpes virus DNA polymerase have a number of uses in developmental, 

diagnostic and therapeutic work. For example, antibodies can be used in drug screening (see U.S. Patent 
No. 5,120,639), or to prepare a passive vaccine. They may also be used for detecting herpes virus in a 
biological sample and for drug targeting, as described in the following sections. 

35 Use of antibodies for drug targeting 

An example of how antibodies can be used in therapy of herpes virus infection is in the specific 
targeting of effector components. Virally infected cells generally display peptides of the virus (including 
internal viral components) on their cell surface in the context of histocompatibility class I antigens. The 
40 peptide therefore provides a marker for infected cells that a specific antibody can bind to. An effector 
component attached to the antibody therefore becomes concentrated near the infected cells, improving the 
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effect on those cells and decreasing the effect on uninfected cells. Furthermore, if the antibody is able to 
induce endocytosis, this will enhance entry of the effector into the cell interior. 

For the purpose of targeting, an antibody specific for the viral polypeptide (in this case, a region of a 
DNA polymerase) is conjugated with a suitable effector component, preferably by a covalent or high-affinity 
5 bond. Suitable effector components in such compositions include radionuclides such as 131 1, toxic 
chemicals, and toxic peptides such as diphtheria toxin. Another suitable effector component is an 
antisense polynucleotide, optionally encapsulated in a liposome. 

In most applications of antibody molecules in human therapy, it is preferable to use human 
monoclones, or antibodies that have been humanized by techniques known in the art. This helps prevent 
1 0 the antibody molecules themselves from becoming a target of the host's immune system. 

Diagnostic kits 

Diagnostic procedures using the polynucleotides, oligonucleotides, peptides, or antibodies of this 

1 5 invention may be performed by diagnostic laboratories, experimental laboratories, practitioners, or private 
individuals. This invention provides diagnostic kits which can be used in these settings. The presence of a 
herpes virus in the individual may be manifest in a clinical sample obtained from that individual as an 
alteration in the DNA, RNA, protein, or antibodies contained in the sample. An alteration in one of these 
components resulting from the presence of a herpes virus may take the form of an increase or decrease of 

20 the level of the component, or an alteration in the form of the component, compared with that in a sample 
from a healthy individual. The clinical sample is optionally pre-treated for enrichment of the target being 
tested for. The user then applies a reagent contained in the kit in order to detect the changed level or 
alteration in the diagnostic component. 

Each kit necessarily comprises the reagent which renders the procedure specific: a reagent 

25 polynucleotide, used for detecting target DNA or RNA; a reagent antibody, used for detecting target 
protein; or a reagent polypeptide, used for detecting target antibody that may be present in a sample to be 
analyzed. The reagent is supplied in a solid form or liquid buffer that is suitable for inventory storage, and 
later for exchange or addition into the reaction medium when the test is performed. Suitable packaging is 
provided. The kit may optionally provide additional components that are useful in the procedure. These 

30 optional components include buffers, capture reagents, developing reagents, labels, reacting surfaces, 
means for detection, control samples, instructions, and interpretive information. 

Other members of the RFHV/KSHV subfamily 

35 RFHV and KSHV are exemplary members of the RFHV/KSHV subfamily. This invention embodies 

polynucleotide sequences encoding DNA polymerase of other members of the subfamily, as defined 
herein. We anticipate that other members of the subfamily will be identified and characterized, including 
some that are capable of infecting primates, including humans. One such member is another virus 
infecting monkeys, designated RFHV2. A segment of the DNA polymerase encoding sequence for this 

40 virus was cloned from RF tissue obtained from a Macaca mulatta monkey, as described in Example 1 1 . 
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In order to identify and characterize other members of the family, reagents and methods of this 
invention are applied to DNA extracted from tissue samples suspected of being infected with such a virus. 
Suitable sources include biological samples obtained from a wide range of conditions occurring in humans 
and other vertebrates. Preferred are conditions in which the agent is suspected of being lymphotrophic, 
5 similar to other members of the gamma herpes virus subfamily; for example, infectious mononucleosis of 
non-EBV origin. More preferred are conditions which resemble in at least one of their clinical or 
histological features the conditions with which RFHV or KSHV are associated. These include: a) 
conditions in which fibroproliferation is part of the pathology of the disease, especially in association with 
collagen deposition, and especially where the fibrous tissue is disorganized; b) conditions involving 

10 vascular dysplasia; c) conditions involving malignant transformation, especially but not limited to cells of 
lymphocyte lineage; d) conditions for which an underlying immunodeficiency contributes to the frequency 
or severity of the disease; e) conditions which arise idiopathically at multiple sites in an organ or in the 
body as a whole; f) conditions which epidemiological data suggests are associated with an infectious or 
environmental agent. Conditions which fulfill more than one of these criteria are comparably more 

15 preferred. Some examples of especially preferred conditions include retroperitoneal fibrosis, nodular 
fibromatosis, pseudosarcomatous fibromatosis, fibrosarcomas, sclerosing mesenteritis, acute respiratory 
disease syndrome, idiopathic pulmonary fibrosis, diffuse proliferative glomerulonephritis of various types, 
gliomas, glioblastomas, gliosis, and all types of leukemias and lymphomas. 

The process of identification of members of the RFHV/KSHV subfamily preferably involves the use of 

20 the methods and reagents provided in this invention, either singularly or in combination. 

One method involves amplifying and/or characterizing a polynucleotide encoding a DNA polymerase 
in the sample. This can be performed, for example, by amplifying the polynucleotide in a reaction such as 
a PCR, using an RFHV/KSHV subfamily specific oligonucleotide, such as those listed in Table 6, as a 
primer in the reaction. The presence of amplified reaction product suggests polynucleotide in the sample 

25 derived from a member of the RFHV/KSHV subfamily. 

Members of the subfamily can also be identified by performing a hybridization assay on the 
polynucleotide of the sample, using a suitable probe. The polynucleotide to be tested may optionally be 
amplified before conducting the hybridization assay, such as by using an oligonucleotide listed in Table 4 
or Table 6 in a PCR. Preferred probes for the hybridization assay include the oligonucleotides of Table 6. 

30 Other preferred probes are fragments of 16 nucleotides or more of the polynucleotide encoding DNA 
polymerase from either RFHV or KSHV, preferably contained in SEQ. ID NO:1 or SEQ. ID NO:3. The 
hybridization reaction is performed under the least stringent conditions wherein the probe will not form a 
stable duplex with a polynucleotide comprising any of SEQ. ID NOS:23 to 29, but will form a stable duplex 
with a polynucleotide comprising SEQ. ID NO:1 or a polynucleotide comprising SEQ. ID NO:3, and 

35 preferably either one. Formation of a stable duplex with the test polynucleotide under these conditions 
suggests the presence of a polynucleotide in the sample derived from a member of the RFHV/KSHV 
subfamily. 

Members of the subfamily can also be identified by using a reagent antibody of a specificity that 
cross-reacts between antigens produced by members of the subfamily, but not with other antigens, 
40 including those produced by herpes viruses not members of the subfamily. Methods for producing such 
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antibodies were outlined in an earlier section. The test is performed, for example, by using the antibodies 

in an immunohistochemistry study of tissue sections prepared from individuals with the conditions listed 

above. Positive staining of a tissue section with the antibody suggests the presence of DNA polymerase in 

the sample from a member of the RFHV/KSHV subfamily, probably because the tissue is infected with the 
5 virus. Similarly, if antibodies cross-reactive with RFHV or KSHV antigens but not with other herpes virus 

antigens are found in the circulation of an individual, this suggests that the individual has been subject to a 

present or past infection with a member of the RFHV/KSHV subfamily. 

Once a member of the RFHV/KSHV subfamily is suspected in a biological sample, it is desirable to 

obtain a fragment of the DNA polymerase gene corresponding to nucleotides 330-501 of Figure 1 . The 
10 fragment is sequenced according to standard techniques to determine whether the virus is a bone fide 

member of the RFHV/KSHV subfamily, as defined herein. A preferred method of identifying members of 

the RFHV/KSHV subfamily is provided below in Examples 11 and 12. 

Once a new member of the RFHV/KSHV subfamily has been identified, other embodiments of this 

invention may be brought into play for purposes of detection, diagnosis, and pharmaceutical development. 
1 5 Changes to render them suitable for the new subfamily member, if required, are expected to be minor and 

will be obvious based on the new sequence data, or will be a matter of routine adjustment. 

Altered forms of DNA polymerase from the RFHV/KSHV subfamily 

20 This invention also embodies altered forms of DNA polymerase of the RFHV/KSHV subfamily. As 

described earlier, work with DNA polymerase from other herpes viruses has helped pinpoint active regions 
and residues of the molecule involved in substrate binding, polymerase activity, or drug resistance. Some 
of the residues described appear in conserved regions of the polymerase molecule, and are identical 
between RFHV, KSHV, and the virus in which they were originally described. By analogy, mutation of the 

25 same residue in the DNA polymerase of the RFHV/KSHV subfamily is expected to have a similar effect: 
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TABLE 9: Pos^^^ 

the RFHV/KSHV Subfamily 






llllllll 


Y->F 


8 


Reduction of DNA polymerase 
activity 


N — * Y 


I uo 




Y -> F or S 






G->D 


107 




Y->F 


168 




G->R 


169 




D -> G or N 


170 




T -> K or P 


171 




D -> A or G 


172 




A->V 


5 


Increased resistance to antiviral 
compounds 


S->N 


10 




P->T 


85 




T->M 


101 




R -> S 


130 





The numbering of the residues in Table 9 begins with the first amino acid encoded by the entire DNA 
polymerase polynucleotide fragment of KSHV shown in Figure 1 (i.e., the first amino acid of SEQ. ID 
NO:4). 

5 DNA polymerase activity is believed to be essential for replication of a herpes virus. Mutations shown 

in Table 9 that are expected to impair DNA polymerase activity may therefore be useful in creating 
attenuated forms of the respective virus. Other mutations may increase or decrease the resistance of the 
RFHV or KSHV polymerase to antiviral drugs. 

Herpes viruses, particularly attenuated forms, are useful in developing viral vectors for therapeutic 

10 purposes (Johnson et al., Ward et al.). One such use is in the development of polyvalent vaccines. It is 
desirable, especially in developing countries, to provide prophylactic vaccines capable of stimulating the 
immune system against several potential pathogens simultaneously. Viruses that are engineered to 
express immunogenic peptides of several different pathogens may accomplish this purpose. Herpes 
viruses may be especially suitable vectors, because the large genome may easily accommodate several 

1 5 kilobases of extra DNA encoding the peptides. Ideally, the viral vector is sufficiently intact to exhibit some 
biological activity and attract the attention of the host's immune system, while at the same time being 
sufficiently attenuated not to cause significant pathology. Thus, an attenuated virus of the RFHV/KSHV 
subfamily may be useful as a vaccine against like virulent forms, and may be modified to express 
additional peptides and extend the range of immune protection. 

20 Another use for attenuated forms of herpes viruses is as delivery vehicles for gene therapy 

(Latchman et al., Glorioso et al.). In order to be effective, polynucleotides in gene therapy must be 
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delivered to the target tissue site. In the treatment of fibrotic diseases, malignancies and related 
conditions, attenuated viral vectors of the RFHV/KSHV subfamily may be preferable over other targeting 
mechanisms, including other herpes viruses, since they have the means by which to target towards the 
affected tissues. In this embodiment, the virus is first attenuated, and then modified to contain the 
polynucleotide that is desired for gene therapy, such as those that are outlined in a previous section. 

The foregoing description provides, inter alia, a detailed explanation of how DNA polymerase 
encoding regions of herpes viruses can be identified and their sequences obtained. Polynucleotide 
sequences for regions of the DNA polymerase gene of RFHV and KSHV are provided. 

The polynucleotide sequences provided are believed to be an accurate rendition of the sequences 
contained in the polynucleotides from the herpes viruses in the tissue samples used for this study. 
However, it is recognized that sequences obtained by amplification methods such as PCR may comprise 
occasional errors in the sequence as a result of amplification. The error rate is estimated to be between 
about 0.44% and 0.75% for single determinations; about the same rate divided by V(n-1) for the consensus 
of n different determinations. Nevertheless, the error rate may be as high as 2% or more. Sequences free 
of amplification errors can be obtained by creating a library of herpes virus polynucleotide sequences, 
using oligonucleotides such as those provided in Table 7 to select relevant clones, and sequencing the 
DNA in the selected clones. The relevant methodology is well known to a practitioner of ordinary skill in 
the art: see, e.g., Example 9. 

It is recognized that allelic variants and escape mutants of herpes viruses occur. Polynucleotides and 
polypeptides may be isolated or derived that incorporate mutations, either naturally occurring, or 
accidentally or deliberately induced, without departing from the spirit of this invention. 

The examples presented below are provided as a further guide to a practitioner of ordinary skill in the 
art, and are not meant to be limiting in any way. 

Examples 

Example 1: Oligonucleotide primers for Herpes Virus DNA polymerase 

Amino acid sequences of known herpes virus DNA polymerases were obtained from the PIR protein 
database, or derived from DNA sequences obtained from the GenBank database. The sequences were 
aligned by computer-aided alignment programs and by hand. Several conserved regions were apparent. 
Three of the most highly conserved regions were chosen for design of amplification primers. The regions 
selected are indicated in Figure 2 as REGION 1 , REGION 2, and REGION 3. 

Having identified suitable conserved regions from the amino acid sequences, the DNA sequences for 
these regions were used to design the oligonucleotide primers. The primers were designed to have a 
degenerate segment of 12-14 base pairs at the 3' end, and a consensus segment of 18-30 bases at the 5' 
end. This provides primers with optimal sensitivity and specificity. 

The degenerate segment extended across the most highly conserved region of herpes virus DNA 
polymerase sequences, encompassing the least number of alternative codons. The primers could 
therefore be synthesized with alternative nucleotide residues at the degenerate positions and yield a 



WO 97/04105 



PCT/US96/11688 



minimum number of combinations. There were no more than 256 alternative forms for each of the primers 
derived. 

The consensus segment was derived from the corresponding flanking region of the DNA polymerase 
sequences. Generally, the consensus segment was derived by choosing the most frequently occurring 
5 nucleotide at each position of all the DNA polymerase sequences analyzed. However, selection was 
biased in favor of C or G nucleotides, to maximize the ability of the primers to form stable duplexes. 

Results are shown in Figures 3-5, and summarized in Table 4. Figure 3 shows DNA sequences of 
known herpes virus DNA polymerase genes near REGION 2. These sequences were used to design the 
oligonucleotides DFASA and DFQSA, as shown. In a PCR, these oligonucleotides would act as primers 
10 by hybridizing with the strand antisense to the coding strand, and initiating polymerization in the same 
direction as the DNA polymerase encoding sequence. Figure 4 shows DNA sequences near REGION 3, 
from which the oligonucleotides VYGA, VYGCA and VYGSQA were designed for initiating polymerization 
in the 5' direction. Figure 5 shows DNA sequences near REGION 3, from which the oligonucleotides 
GDTD1 B and GDTDSQB were designed. These oligonucleotides would hybridize with the coding strand 
1 5 and initiate polymerization in the direction opposite to that of the DNA polymerase encoding sequence. 

Synthetic oligonucleotides according to the designed sequences were ordered and obtained from 
Oligos Etc, Inc. 

Example 2: DNA extraction 

20 

Biopsy specimens were obtained from Kaposi's sarcoma lesions from human subjects diagnosed 
with AIDS. Specimens were also obtained from retroperitoneal fibromatosis lesions in a colony of Macaca 
nemestrina, Macaca fascicularis, and Macaca fuscata at the University of Washington Regional Primate 
Research Center. 

25 The specimens were fixed in paraformaldehyde and embedded in paraffin, which were processed for 

normal histological examination. 

Fragments of the paraffin samples were extracted with 500 ^iL of xylene in a 1.5 mL EPPENDORF™ 
conical centrifuge tube. The samples were rocked gently for 5 min at room temperature, and the tubes 
were centrifuged in an EPPENDORF™ bench-top centrifuge at 14,000 rpm for 5 min. After removing the 

30 xylene with a Pasteur pipette, 500 pL of 95% ethanol was added, the sample was resuspended, and then 
re-centrifuged. The ethanol was removed, and the wash step was repeated. Samples were then air-dried 
for about 1 hour. 500 of proteinase-K buffer (0.5% TWEEN™ 20, a detergent; 50 mM Tris buffer pH 
7.5, 50 mM NaCI) and 5 of proteinase K (20 mg/mL) were added, and the sample was incubated for 3 h 
at 55°C. The proteinase K was inactivated by incubating at 95°C for 10 min. 

35 

Example 3: Obtaining amplified segments of RFHV and KSHV DNA polymerase 

The oligonucleotides obtained in Example 1 were used to amplify DNA extracted in Example 2, 
according to the following protocol. 
40 A first PCR reaction was conducted using 1 of DNA template, 1 \iL of oligonucleotide DFASA (50 

pmol/^L), 1 jul_ of oligonucleotide GDTD1B (50 pmol/^iL), 10 jiL of 10 x WB4 buffer (0.67 M Tris buffer pH 
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8.8, 40 mM MgCI 2 , 0.16 M (NH 4 ) 2 S0 4l 0.1 M p-mercaptoethanol, 1 mg/mL bovine serum albumin), 1 nL 
containing 2.5 mM of each of the deoxyribonucleotide triphosphates (dNTPs), 66 jiL distilled water, and 50 
mineral oil. The mixture was heated to 75°C in a Perkin-Elmer (model 480) PCR machine. 0.5 nL Taq 
polymerase (BRL, 5 UfyL) and 19.5 jj.L water was then added. 35 cycles of amplification were conducted 
5 in the following sequence: 1 min at 94°C, 1 min at 60°C, and 1 min at 72°C. 

A second PCR reaction was conducted as follows: to 1 yL of the reaction mixture from the previous 
step was added 10 \il 9-10 x WB4 buffer, 1 \xl dNTPs, 0.5 nL Taq polymerase, 86.5 \iL water, and 50 \iL 
mineral oil. The mixture was heated to 75°C, and 1 jllL each of oligonucleotide VYGA (50 pmol/jiL) and 
oligonucleotide GDTD1 B (50 pmol/^iL) was added. 35 cycles of amplification were conducted as before. 

10 

Example 4: Sequence of the 236 base fragment 

An aliquot of the final amplification mixture of each of the samples was purified by electrophoresis on 
a 2% agarose gel. Of 9 M. nemestrina and 1 M. fasicularis samples used, 4 M. nemestrina samples 

1 5 yielded amplification product. Amplification product was also obtained from human samples. The agarose 
gel was stained with ethidium bromide and the DNA was visualized using U.V. light. Bands of the correct 
size were eluted onto DEAE paper. Each extracted polynucleotide was ligated to a PGEM-T™ vector and 
transformed into competent bacteria (E. coli JM-109). Bacterial clones containing the amplified DNA were 
picked and cultured. The bacteria were lysed and the DNA was extracted using phenol-chloroform 

20 followed by precipitation with ethanol. Sequencing was performed by the Sanger & Nicholson 
dideoxynucleotide method, using M13 forward and reverse primers. 

The length of the fragment in between the primer hybridizing regions was 172 base pairs in length. 
For the four M. nemestrina samples used, all yielded identical sequence data. About 70% of the residues 
are identical in the fragment between RFHV and KSHV. Compared with the most closely related known 

25 sequences of the herpes virus family, differences in between sequences are distributed along the entire 
length of this fragment. The longest stretch of consecutive nucleotides that is identical between any two 
sequences in this fragment is 1 1 . 

The polypeptide encoded in this fragment is 81% identical between RFHV and KSHV, of which the 
first 24 residues are 100% identical, and the first 31 are 97% identical. The longest stretch of consecutive 

30 amino acids that is identical between RFHV or KSHV and any of the other known herpes virus DNA 
polymerases in this fragment is 10. 

Example 5: RFHV and KSHV specific amplification assays 

35 Four oligonucleotides were prepared based on the sequence of the polynucleotide fragment of the 

RFHV and KSHV DNA polymerase for use in nested virus-specific amplification reactions. Primers 
VASGA, ILPCA, PIEAB and PEARB were based on the RFHV sequence; primers SGILA, CLNIA, IEASB, 
and EARFB were based on the KSHV sequence (T able 7). The RFHV primers were used to amplify DNA 
samples obtained from the PBL of macaque monkeys as follows: 

40 Uncoagulated whole blood samples were collected from 20 M. nemestrina born in the colony at the 

University of Washington. 30 blood samples were obtained from wild-caught M. nemestrina. None of the 
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animals had overt symptoms of fibromatosis. Plasma and blood cells were separated by centrifugation. 
Peripheral blood mononuclear cells (PBMC) were prepared by centrifuging the cells through a density 
gradient, according to standard blood separation techniques. DNA was extracted from the cells according 
to the method of Example 2. The DNA was then amplified, first using primers VASGA and PEARB, then 
5 using primers ILPCA and PIEAB. The conditions of the amplification were similar to that of Example 3. 
The reaction product was run on an agarose gel, stained with ethidium bromide, and examined under U.V. 
light. 

When the assay was performed in duplicate and under conditions to avoid cross-contamination of 
PCR reaction products, none of the RF symptom-free monkeys were found to have detectable levels of 
1 0 RFHV polynucleotide encoding DNA polymerase in their peripheral blood by this assay. 

PBMC may also be examined by immunohistology techniques to confirm correlation between positive 
PCR products and RFHV antigenemia. PBMC are coated onto microscope slides, and fixed with a mixture 
of 50% methanol, 20% acetone and 30% water. They are overlaid with a primary serum, washed, overlaid 
with FITC-(rabbit anti-monkey IgG) (Nordic Labs), washed again, and then examined by fluorescence 
1 5 microscopy. 

Antibody-containing serum may be obtained from a monkey giving a positive RFHV amplification 
assay result, or an animal immunized with RFHV, or an RFHV extract. Serum from a monkey with a 
negative result may be used as control. PBMC from animals giving a positive result in the amplification 
test will also give a positive immunohistology result due to antigenemia of an RFHV antigen component. 
20 To conduct an amplification assay for KSHV, DNA is extracted from tissue suspected of harboring the 

virus; particularly biopsy samples from human subjects with Kaposi's Sarcoma lesions and body cavity 
B-cell lymphoma. The DNA is amplified in two stages, using primers SGILA and EARFB in the first stage, 
and CLNIA and IEASB in the second stage. As before, a positive result is indicated by the presence of 
abundant polynucleotide in the reaction product, as detected by ethidium bromide staining. 

25 

Example 6: Upstream sequence of the RFHV and KSHV DNA polymerase 

DNA from Kaposi's Sarcoma tissue similar to that used in Example 3 was used in additional 
amplification reactions to obtain a longer fragment of the gene encoding KSHV DNA polymerase. The 

30 oligonucleotides DFASA and GDTD1B were used to prime a first-stage amplification reaction, as in 
Example 3, and the reaction product was separated on an agarose gel. The size of the fragment from 
DFASA to GDTD1B (now known to be 536 bases long) was estimated from the known sHV1 and EBV 
sequences, and a corresponding band was recovered from the gel. The extracted polynucleotide was 
subjected to a second round of amplification using the same primers. The product was cloned into E. coli 

35 as in Example 4. 

Clones containing suitable inserts were identified from three different amplifications of the DNA 
extracted from the tissue. The clone inserts were sequenced from both ends using vector-specific 
oligonucleotides (M13 forward and reverse primers). About 160 nucleotides from the 5' end (including the 
DFASA hybridizing region) and about 233 nucleotides from the 3' end (including the GDTD1B hybridizing 
40 region) were sequenced for all three amplifications. The centermost portion of the fragment was 
sequenced in one of the three amplifications. 
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A consensus sequence for the fragment was obtained by combining results of the three 
determinations with the results of Example 4, as appropriate. The data are shown in Figure 1, in 
comparison with the sequence determined for the RFHV DNA polymerase fragment in Example 4. 
Numbering of both sequences begins at the first position of primer DFASA. 
5 Regions of each sequence corresponding to hybridization sites for DFASA and GDTD1B may not be 

accurate reflections of the target sequence. The fragment between the primers is believed to represent the 
DNA from which the polynucleotide used for sequencing was amplified. However, occasional errors may 
have been introduced during the amplification. Assuming the consensus sequence of KSHV to be an 
accurate reflection of the sequence of the DNA extracted from the tissue, there was about a 0.75% error 

1 0 rate in the sequence of each amplified product in the nucleotides towards the 5' end, and about a 0.44% 
error rate in the sequence towards the 3' end, not including the region hybridizing with the primers. 

To obtain the corresponding RFHV polynucleotide sequence, DNA from frozen RF tissue of a 
macaque monkey was first amplified using the broad specificity DNA polymerase primer DFASA in 
conjunction with the RFHV specific primer PEARB, and then by DFASA in conjunction with the RFHV 

1 5 specific primer PIEAB. 

The procedure was as follows: 5 jil_ of DNA template was mixed with 1 \iL of each of the primers (50 
pmol/^L), 10 \il of 10 x WB4 buffer, 1 \iL 2.5 mM dNTP, 59-65 water, and 60 mineral oil. The 
temperature was raised to 60°C, and Taq polymerase (0.5 \il diluted to 20 \iL in water) was added. The 
DNA was amplified for 35 cycles of 94°C for 1 min, 55°C for 1 min, and 72°C for 1 min. 2 \xL of the 

20 amplified product was added to 10 \iL 10 x WB4 buffer, 1 \iL 2.5 mM dNTP, 66.5 yL water, 0.5 ^iL Taq 
polymerase, and 60 mineral oil. The temperature was raised to 60°C, and then a mixture of 1 pL PIEAB 
(50 pmol/fiL), 2 jiL DFASA (50 pmol/^iL), and 18 \iL of water was added. Amplification cycles were 
conducted as before. Finally, a third round of amplification was performed to introduce a radiolabel. 
Oligonucleotide PIEAB was end-labeled with gamma 32 P-ATP, and 1 \iL was added to 20 \iL of the 

25 reaction mixture from the previous amplification step, along with 1 \iL 2.5 mM dNTP and 1 ^iL Taq 
polymerase. Amplification was conducted through five cycles of 94°C, 55°C and 72°C, as before. 

An aliquot of the radiolabeled reaction product was electrophoresed on a 6% polyacrylamide 
sequencing gel. A band of the correct size (predicted by analogy with the KSHV sequence) was identified 
by autoradiography, and cut out of the dried gel. DNA was eluted by incubation in 50 water. A further 

30 amplification reaction was performed using 2 \iL of eluted DNA, 10 j^L 10 x WB4 butter, 1 ^iL 2.5 mM 
dNTP, 1 \xL PIEAB (50 pmol/^L), 1 \iL DFASA (50 pmol/^L), 0.5 nL Taq polymerase, and 84.5 jiL water. 
Amplification was conducted through 35 cycles of 95°C for 30 sec, 60°C for 30 sec, and 72°C for 65 sec. 
The amplified product was isolated using a QUIAEX™ gel extraction kit, and the DNA was cloned into 
pGEM™-t vector. JM-109 cells were transformed with the DNA, and colonies containing inserts were 

35 isolated. Colonies containing inserts of the correct size were used to obtain DNA for sequencing. 

Data from these experiments were combined with that from Example 4 to provide the sequence of 
536 base pairs corresponding to the RFHV and KSHV DNA polymerase gene. Omitting the outermost 
primer-hybridizing regions, 475 base pairs of each sequence have been determined for both RFHV and 
KSHV. These sequences are listed in Figure 6, in comparison with the corresponding region of the DNA 

40 polymerase gene from other sequenced gamma herpes viruses. The longest region that is identical 
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between the RFHV sequence and any of the other viruses is a first 20 base pair subfragment (SEQ. ID 
NO:1 1 0) and a second 20 base pair fragment (SEQ. ID NO:1 1 1) shared with eHV2. 

Figure 7 shows the corresponding encoded polypeptide sequences. There is a linear sequence of 
about 31 residues near the middle of SEQ. ID NO:2 shared between the DNA polymerase of RFHV and 
5 eHV2. This shared sequence is listed separately in SEQ. ID NO:112. A sequence of 26 amino acids is 
shared in the same area between RFHV and sHV1 , and two sequences of 12 amino acids shared between 
RFHV and EBV. These areas of homology map near conserved REGION 3 of the other herpes virus DNA 
polymerase sequences (Figure 2). A second shared sequence occurs near the beginning of SEQ. ID 
NO:4 between KSHV and other gamma herpes viruses. This sequence maps near conserved REGION 2 
10 of other herpes virus DNA polymerase sequences. This sequence fragment shared between KSHV and 
other gamma herpes viruses is listed separately in SEQ. ID NO:1 13. 

Figure 8 provides a comparison of the protein sequence across the spectrum of different herpes 
viruses corresponding to the sequence encoded by the 475 base pair sequence obtained herein for RFHV 
and KSHV. 

15 The degree of identity between sequences can be used to construct a relationship map between 

DNA polymerases, as shown in Figure 9. The relationship between the species may reflect the relative 
ancestral relationship between the polypeptides, and between the organisms that encode them. Based on 
this analysis, RFHV and KSHV are provisionally assigned to the gamma subfamily of herpes viruses, 
which also includes eHV2, sHV1 and EBV. Other viruses of the RFHV/KSHV subfamily would be 

20 assignable to the herpes virus gamma subfamily on this basis. 

Example 7: Oligonucleotide primers and probes for the RFHV/KSHV subfamily 

Based on the sequence of the 475 base pair polynucleotide fragment obtained for RFHV and KSHV, 
25 five oligonucleotides were designed that could be used either as PCR primers or as hybridization probes 
with members of the RFHV/KSHV subfamily. These oligonucleotides were designated LSGGA, CTDPA, 
PCLNA, KMLEA, and GISPA. 

These oligonucleotides are shown in Figure 10, alongside the sequences they were derived from. 
Like the oligonucleotides of Example 1, they have a consensus segment towards the 5' end, and a 
30 degenerate segment towards the 3' end. However, these oligonucleotides are based only on the RFHV 
and KSHV sequences, and will therefore preferentially form stable duplexes with DNA polymerase of the 
RFHV/KSHV subfamily. Under hybridization conditions that permit them to form stable duplexes with the 
RFHV or KSHV encoding polynucleotide fragment, they are expected to form stable duplexes with more 
members of the RFHV/KSHV subfamily than would equal-length polynucleotides of the RFHV or KSHV 
35 sequence, either alone or in combination. 

Both oligonucleotides are oriented in the same direction. In a PCR amplification reaction, one or the 
other of these oligonucleotides may be used as primers in combination with a primer with the opposite 
orientation, such as GDTD1B. 
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Example 8: Antigenic and immunogenic regions ofRFHVand KSHVDNA polymerase 

Based on the 475 base pair polynucleotide sequence of the RFHV and KSHV DNA polymerase 
encoding region, it is possible to predict what sites on the protein unique for each virus, and therefore 
5 constitute potential sites for the binding of virus-specific antibodies. 

Figure 7 shows example peptides of 6 or 7 amino acids in length. Some of the peptides comprise 
one or more residues that are distinct either for RFHV or KSHV (Class III), or for the RFHV/KSHV 
subfamily (Class II) compared with the corresponding gamma herpes virus peptides. These peptides were 
listed earlier in Table 8. The numbering of the amino acid residues in both Figure 7 and Table 8 begins 
10 with the first amino acid coded after the hybridization site of the VYGA primer (nucleotide position 331 of 
Figure 1). 

To confirm that regions contained within this 57-amino acid region of the DNA polymerase may be 
recognized by antibody, computer analysis was performed to generate Hopp and Woods antigenicity plots. 
The Hopp and Woods determination is based in part on the relative hydrophilicity and hydrophobicity of 

1 5 consecutive amino acid residues (Hopp et al). 

Results are shown in Figure 11 and Figure 12. The numbering of RFHV begins with the first amino 
acid coded after the VYGA primer (as in Figure 7). The numbering of the KSHV polypeptide residues in 
Figure 12 begins with the first amino acid coded after the hybridization site of the DFASA primer 
(nucleotide position 28 of Figure 1). 

20 Both RFHV and KSHV contain several regions predicted to be likely antibody target sites. For 

example, the RFHV shows several hydrophobic and antigenic patches along the amino acid sequence. 
KSHV shows hydrophobic patches beginning at residues 26, 44, 52, 121 and 151; and antigenic patches 
beginning at residues 8, 37, 45, and 94. The peptides of Figure 7 that correspond to some of these 
regions may be especially antigenic. 

25 

Example 9: Sequencing the complete RFHV and KSHVDNA polymerase coding region 

Additional sequence data for the KSHV DNA polymerase encoding region has been obtained to the 
5' and 3* direction of the segment described in Example 6. 
30 Two Kaposi's sarcoma samples, designated K-12 and K-15, were used to prepare DNA according to 

the method of Example 2. 

Additional Type 1 oligonucleotide primers were designed to hybridize with herpes virus DNA 
polymerase nucleic acid sequences flanking the KSHV sequence already obtained. Examples are shown 
in Table 10: 
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TABLE 10: Additional Type 1 Oligonucleotic 
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137 


YFDKB 


TTAGCTACTCCGTGGAGCAGY 
TTRTC RAARTA 


32 


16 


Herpes DNA 
polymerase 
(especially 
gamma) 


3'->5' 


138 















PCR amplification was conducted as follows: 100 ng of DNA from each sample was first amplified in 
50 pL total reaction buffer under the following conditions: 1 x PCR buffer (67 mM Tris buffer pH 8.8, 16 
mM (NH4)2S04, 10 mM p-mercaptoethanol, 0.1 mg/mL bovine serum albumin), 2 mM MgCl2, 50 pmol of 
5 each oligonucleotide CVN1A and FDIEC1B, 100 jiM (each) dATP, dCTP, dGTP, dTTP, 1.25 units Taq 
DNA polymerase (AMPLITAQ™, Perkin-Elmer Cetus). Amplification was conducted through 45 cycles of 
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95°C for 30 sec; 50°C for 30 sec, and 72°C for 30 sec. PCR products were electrophoresed on a 2% 
agarose gel and visualized by ethidium bromide staining. 

PCR products were purified using QIAQUICK SPIN™ PCR purification kit (Qiagen, Chatsworth CA). 
Products were cloned into PT7BLUE® Vectors (Novagen, Madison Wl). Plasmids were purified using 
5 QUIAGEN SPIN™ plasmid miniprep kit. Purified plasmids were sequenced using ABI automated 
sequencing methodology, using M13 forward and reverse primers. Five clones were sequenced from 
each of K-12andK-15. 

Two body cavity lymphoma cell lines, designated BC-1 and BC-2, were used as DNA sources for 
determination of downstream sequence. 5 x 10 5 cells from each line were washed in PBS and pelleted 
10 separately. Proteinase-K buffer was added to each pellet and incubated at 65°C for 1 h. DNA was 
extracted twice with 1:1 (vohvol) phenolxhloroform, precipitated and washed in ethanol, and resuspended 
in 10 mM Tris buffer pH 8.0. 

Approximately 0.5 \xg total genomic DNA from BC-1 and BC-2 cell lines was used with 25 pmol of 
oligonucleotide primers CVNVA and EARFB, 2.5 units Taq DNA polymerase (Boehringer-Mannheim), 250 
15 fiM dNTP, and 4 mM MgCI 2 in a total volume of 100 \il of 1 x PCR buffer. PCR amplification was 
conducted using a "hot start" at 70°C for 1 min prior to adding the Taq polymerase, and conducted through 
35 cycles of 94°C for 45 sec; 60 °C for 45 sec, and 72 °C for 90 sec. PCR products were electrophoresed 
on a 2% agarose gel and visualized by ethidium bromide staining. 

PCR products were purified and cloned into PT7BLUE® vectors as before. Purified plasmids were 
20 sequenced using ABI automated sequencing methodology using KSHV sequence specific primers 
RDSWA, FDCSA, YSTLB, and DYETB. DNA sequences were analyzed using the GenePro algorithm for 
single alignments open reading frames. The ClustalW algorithm was used for determining consensus 
sequences for multiple alignments. 

The nucleotide sequence obtained is shown in Figure 13 (SEQ. ID NO:116) along with the encoded 
25 amino acid sequence (SEQ. ID NO:117). In total, 2511 nucleotides are shown, of which the first 35 
correspond to the CVNVA primer, and the last 12 correspond to the YFDKB primer. Bases 36 to 2499 of 
SEQ. ID NO:116, corresponding to amino acids 13 to 833 of SEQ. ID NO:117, represent the KSHV DNA 
polymerase sequence. 

Alignment of part of the KSHV DNA polymerase amino acid sequence with other herpes viruses is 
30 shown in Figure 14. Residues marked with an asterisk (*) are identical amongst ail the sequences shown. 
Residues marked with a bullet (•) represent conservative amino acid substitutions. Residues marked with 
an arrow (t) are of interest, because they are conserved between other herpes viruses but are different in 
KSHV. One of these is a histidine in KSHV in the position of an aspartic acid in other viruses, which is a 
non-conservative difference. Residues marked with an arrow may be suitable targets for antibodies or 
35 drugs that are specific for KSHV or for the RFHV/KSHV subfamily. 

Amongst four KSHV DNA polymerase nucleotide sequences obtained, variations were noted in four 
positions. These are believed to represent naturally occurring allelic variants. At about nucleotide 319, the 
sequence TTCTCG was alternatively found as TTTTCG, which is a silent variation (not affecting the 
encoded protein sequence). At about nucleotide 348, the sequence AACCCG was alternatively found as 
40 AATCCG, which is also a silent mutation. At about nucleotide 1795, the sequence CCAGTA was 
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alternatively found to be CCAATA, which represents a change of the encoded peptide from -Pro-Val- to - 
Pro-lie. At about nucleotide 1822, the sequence TTCAAG was alternatively found to be TTCAGG, which 
represents a change of the encoded peptide from -Phe-Lys- to -Phe-Arg-. Alignment of the KSHV amino 
acid sequence variants with DNA polymerase sequences of other herpes viruses is shown in Figure 1 5. 

Comparison of the KSHV DNA polymerase nucleotide sequence with that of other herpes viruses led 
to design of additional Type 3 (virus-specific) oligonucleotides, listed in Table 1 1 : 
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•i|6rlen^:' : . ; :: 
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TTGTGCGCTTGGATGATACT 
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21 


1 


KSHV DNA 
polymerase 


3'->5' 


139 


HVLQB 


GAGGGCCTGCTGGAGGACG 
TG 


21 


1 




3'->5' 


140 


";sggfb 


CGGTGGAGAAGCCGCAGGA 

i \j 


21 


1 




3'->5* 


141 




ACCTCCCGCACCTGACCGTG 
T 


21 


1 




5'-»3' 


142 




AAGCTAGACAGGAGGAGCTT 
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21 


1 




5'->3' 


143 




ACTTGAATTATCTTGACGAAC 


21 


1 




3'->5' 


144 




ACGACAAGGTTCTGATGAAG 
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21 


1 




5^3' 


145 


RDSVVA ; i 


AGAGACTCTTGGACGGAACT 
G 


21 


1 


KSHV DNA 
polymerase 


5'->3' 


146 


: FDGSA J 


AGTTTGACTGCAGCTGGGAG 
G 


21 


1 




5'->3' 


147 


. YSTIB 


CGGGTATCAGTGTGGAGTAG 

• 

C 


21 


1 




3'->5' 


148 


DYETB 


GAGGACAAAGGTTTCGTAGT 
C 


21 


1 




3'->5' 


149 



Based upon other gamma herpes viruses, the KSHV DNA polymerase sequence is predicted to 
comprise a total of about 3000 base pairs, with additional nucleotides in both the 5' and 3' direction of the 
sequence shown. The remaining sequence may be determined by conducting the approach described on 
samples of affected tissue, using Type 1 oligonucleotides to sequence in from genes flanking the DNA 
polymerase in both the upstream and downstream direction. 

Alternatively, complete DNA polymerase sequences may be obtained by generating DNA libraries 
from affected tissue. For the RFHV sequence, libraries are prepared from macaque monkey PBMC known 
from the amplification assay of Example 5 to contain RFHV DNA. For the KSHV sequence, libraries are 
prepared from Kaposi's sarcoma lesions or B cell body cavity lymphoma. 

The DNA lysate is digested with proteinase K, and DNA is extracted using phenol-chloroform. After 
extensive dialysis, the preparation is partially digested with the Sau3A I restriction endonuclease. The 
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15 



-67- 



WO 97/04105 



PCT/US96/11688 



digest is centrifuged on a sucrose gradient, and fragments of about 10-23 kilobases are recovered. The 
lambda DASH-2™ vector phage (Stratagene) is prepared by cutting with BamHI. The size-selected 
fragments are then mixed with the vector and ligated using DNA ligase. 

The ligated vector is prepared with the packaging extract from Stratagene according to 
5 manufacturer's directions. It is used to infect XL1-BLUE™ MRA bacteria. About 200,000 of the 
phage-infected bacteria are plated onto agar at a density of about 20,000 per plate. After culturing, the 
plates are overlaid with nitrocellulose, and the nitrocellulose is cut into fragments. Phage are eluted from 
the fragments and their DNA are subjected to an amplification reaction using appropriate virus-specific 
primers. The reaction products are run on an agarose gel, and stained with ethidium bromide. Phage are 

10 recovered from regions of the plate giving amplified DNA of the expected size. The recovered phage are 
used to infect new XL1 bacteria and re-plated in fresh cultures. The process is repeated until single clones 
are obtained at limiting dilution. 

Each clone selected by this procedure is then mapped using restriction nucleases to ascertain the 
size of the fragment incorporated. Inserts sufficiently large to incorporate the entire DNA polymerase 

1 5 sequence are sequenced at both ends using vector-specific primers. Sequences are compared with the 
known polynucleotide sequence of the entire EBV genome to determine whether the fragment spans the 
intact DNA polymerase sequence. DNA is obtained from suitable clones, sheared, and sequenced by 
shot-gun cloning according to standard techniques. 

20 Example 10: Identifying immunogenic sites 

To identify what antibodies may be generated during the natural course of infection with RFHV, serial 
serum samples are obtained from 10-20 macaque monkeys giving a positive result in an RFHV DNA 
polymerase amplification test with PBMC, as in Example 5. To test for antibodies against KSHV, serum 

25 samples are obtained from 10-20 AIDS subjects with Kaposi's Sarcoma lesions, from 10-20 HIV-positive 
symptom-negative subjects, and 10-20 HIV-negative controls. In initial studies, sera in each population 
are pooled for antibody analysis. 

Peptides 12 residues long are synthesized according to the RFHV or KSHV sequence, as 
appropriate. Sequential peptides are prepared covering the entire sequence, and overlapping by 8 

30 residues. The peptides are prepared on a nylon membrane support by standard F-Moc chemistry, using a 
SPOTS™ kit from Genosys according to manufacturer's directions. Prepared membranes are overlaid 
with the serum, washed, and overlaid with beta-galactose conjugated anti-monkey IgG or anti-human IgG, 
as appropriate. The test is developed by adding the substrate X-gal. Positive staining indicates IgG 
antibody reactivity in the serum against the corresponding peptide. 

35 

Example 11: Obtaining other DNA polymerase sequences of the RFHV/KSHV subfamily 

A DNA polymerase encoding sequence from a third member of the RFHV/KSHV herpes virus 
subfamily was obtained as follows. 
40 DNA was extracted from two frozen tissue samples from a Macaca mulatta monkey with 

retroperitoneal fibromatosis. Extraction was conducted according to Example 1 . The extracted DNA was 
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precipitated with ethanol in the presence of 40 \ig glycogen as carrier, washed in 70% ethanol, and 
resuspended in 10 mM Tris buffer, pH 8.0. 

A 151 base pair fragment of a DNA polymerase encoding sequence was amplified using a triple- 
nested PCR: 1 00 ng of DNA from each sample was first amplified in 1 00 nL total reaction buffer under the 
5 following conditions: 1 x PCR buffer (67 mM Tris buffer pH 8.8, 16 mM (NH4) 2 S0 4 , 10 mM p- 
mercaptoethanol, 0.1 mg/mL bovine serum albumin), 4mM MgCI 2 , 25 pmol of each oligonucleotide VYGA 
and VYGCA, and 50 pmol of oligonucleotide GDTD1B, 25 \M (each) dATP, dCTP, dGTP, dTTP, 2.5 units 
Taq DNA polymerase (Boehringer-Mannheim). Amplification was conducted using a "hot start" at 70°C for 
1 min prior to adding the Taq polymerase, and conducted through 42 cycles of 94°C for 30 sec; 60 °C for 
10 30 sec, and 72 °C for 30 sec. A second amplification used 2 1± of the primary PCR product as template in 
50 \il reaction volume as before, except that 25 pmol of each oligonucleotide PCLNA and GDTDSQB and 
1 .25 units Taq polymerase were used. Amplification was conducted using a "hot start" and 35 cycles of 
the same conditions as before. A third amplification used 2 \xl of the secondary PCR product as template 
in 50 nL reaction volume as before, except that 25 pmol of each oligonucleotide KMLEA and GDTDSQB 
15 and 1.25 units Tag polymerase were used. Amplification was conducted using a "hot start" and 35 cycles 
of 94°C for 30 sec; 65 °C for 30 sec, and 72 °C for 30 sec. 

The final PCR product was electrophoresed on a 3% agarose gel, and visualized by ethidium 
bromide staining. PCR products were purified using QIAQUICK SPIN™ PCR purification kit (Qiagen, 
Chatsworth CA). Products were cloned into PC7BLUE® vectors (Novagen, Madison Wl). Plasmids were 
20 purified using QUIAGEN SPIN™ plasmid miniprep kit. Purified plasmids were sequenced using M13 
forward and reverse primers with the USB Sequenase 7-deaza-dGTP kit. 

Based on the sequence of the 151 base pair fragment, two sequence-specific (Type 3) 
oligonucleotides were designed, designated KVIYB and ASPDB. These were used in a nested PCR 
amplification with the Type 1 oligonucleotide QAHNA to obtain a 468 base pair fragment. The first 
25 amplification was conducted by using -1 ng of each DNA sample in 1 00 \iL PCR mixture as before, except 
that 50 pmol of each of QAHNA and KVIYB were used as primers. Amplification was conducted using a 
"hot start" and 35 cycles of 94°C for 30 sec; 55 °C for 60 sec, and 72 °C for 60 sec. The second 
amplification was conducted using 3 primary PCR product as template in 100 jiL reaction volume and 
50 pmol of QAHNA and ASPDB as primers. Amplification was conducted using a "hot start" and 40 cycles 
30 of 94°C for 30 sec; 60 °C for 60 sec, and 72 °C for 60 sec. PCR products were then electrophoresed on a 
2.5% agarose gel, and visualized by ethidium bromide staining. PCR products were cloned into 
PC7BLUE® vectors and sequenced as before. 

The nucleotide sequence obtained is shown in Figure 16, shown with the designation "RFMm" (SEQ. 
ID NO:118). This corresponds to a DNA polymerase encoding sequence referred to elsewhere in this 
35 application as "RFHVMm" or "RFHV2". The encoded protein sequence is shown in Figure 17 (SEQ. ID 
NO:119). 

Identity analysis with DNA polymerase sequences of RFHVMm with other herpes viruses is shown in 
Table 12: 
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Bases 1-454 
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— 










lllill^llill 


llllll 


63% (66%) 
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59% (64%) 
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57% (62%) 


54% (63%) 


62% (68%) 










HSV1 ||:| 


36 


52% (43%) 


53% (46%) 


53% (46%) 


llllllii^ 


llllll 


53% (44%) 


53% (46%) 


53% (46%) 






42% (41%) 


45% (43%) 


48% (45%) 










h'GMV 


33 


45% (38%) 


53% (41%) 


49% (40%) 


HHV6 


42 


44% (38%) 


46% (41%) 


48% (41%) 



Percent identity shown at the polynucleotide level and the (amin o acid level) 



Phylogenetic studies were performed using distance matrices, neighbor joining and bootstrap 
analysis, as implemented in the PHYLIP analysis package. Figure 18 shows the results of the bootstrap 
analysis, with the numbers indicated being the score out of 100 supporting the branch points shown. This 
5 analysis strongly supports a branch point that separates the RFHV/KSHV subfamily from other gamma 
herpes viruses. RFHVMm and RFHVMn are more closely related to each other than either are to KSHV. 

Sequences were also analyzed for G+C content. Results are shown in Table 13. The percentage of 
G+C across the region corresponding to the 454 bp of RFHVMm is shown, as calculated using GenePro 
software (Riverside Scientific). Values in parenthesis are G+C content calculated for the entire DNA 
10 polymerase sequence, where known. Also shown are CpG ratios, which is the ratio of observed:expected 
frequencies of CpG, taking into consideration the monomucleotide composition. 
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TABLE 13: G+C mononucleotide and CpG dinucleotide frequencies in the DNA 






IlllllllPlil^ 




RFHVMh 




55.9% 


0.96 


RFHVMm 




51.1% 


1.13 


S|| kshv 1 




54.4% 


0.91 






64.8% (63.6%) 


0.71 (0.75) 






39.9% (34.9%) 


0.28 (0.33) 


EBV 




64.4% (61.8%) 


0.73 (0.70) 






67.6% (65.8%) 


0.99 (1.06) 


vzv 




41.5% (42.2%) 


1.37 (1.15) 


hCMV 




60.2% (59.9%) 


1.27 (1.23) 






46.0% (40.9%) 


1.00 (1.12) 



The G+C frequences of the KS and RF sequences are quite similar to each other, falling midway 
between the high G-C content of EBV and eHV2 and the low G+C content of sHV1 . The CpG dinucleotide 
frequences of KS and RF are quite similar, close to the expected value (1.00) based on their 
5 mononucleotide compositions. These values are closer to those for alpha and beta herpes viruses, than 
for gamma herpes viruses outside the RFHV/KSHV subfamily. The CpG data suggest that RFHVMn, 
RFHVMm and KSHV genomes remain latent in non-dividing cells, in contrast to sHV1 and EBV which are 
latent in proliferating lymphoblastoid cells. 

The phylogenetic analysis, the CpG analysis, and the similarity between symptoms caused by the 
1 0 three viruses support the use of the monkey viruses as models for KSHV, and other members of the 
RFHV/KSHV subfamily that may infect humans. 

Examples of RFHVMm-specific Type 3 oligonucleotides is shown in Table 14: 



TABLE 14: Type 3 Oligonucleotides Specific for Polynucleotides Encoding DNA Polymerase from 


nation 


Sequence 


Length 


forms 


:: ^'^Target:^:': : J.\ ' 


Orien- 
tation 


SEQ 
ID: 


LCYSA 


CTATGTTACTCTACCCTGATT 


21 


1 


RFHVMm DNA 
Polymerase 


5'->3' 


150 


KVIYB 


GTATATCTCTTTAAACCTGGC 


21 


1 




3*->5* 


151 


ASPDB 


AACCTGGCGTCCGGGGAAG 
CG 


21 


1 




3'->5' 


152 



15 Figure 19 is a map showing the approximate relative positions for hybridization of certain 

oligonucleotides of this invention along the DNA polymerase encoding sequence. Numbering of 
nucleotide residues is approximate, and based on a starting position in the Glycoprotein B encoding 
region, which flanks the DNA polymerase encoding region in the upstream direction. Following each 
oligonucleotide designation is an abbreviation in lower case which indicates the type of oligonucleotide: h 
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= ail herpes viruses (Type 1); sq = additional sequencing tail available; g = gamma herpes viruses (Type 
1); f = RFHV/KSHV subfamily herpes viruses (Type 2); m = RFHVMm specific (Type 3); n = RFHVMn 
specific (T ype 3); ks = KSHV specific (Type 3). 

The phylogenetic analysis, the CpG analysis, and the similarity between symptoms caused by the 
5 three viruses support the use of the monkey viruses as models for KSHV, and other members of the 
RFHV/KSHV subfamily that may infect humans. 

Oligonucleotide primers were used in a screening assay to detect the presence of DNA polymerase 
encoding sequences in various biological samples. The results are shown in Figure 20. Results of RF 
samples of M. nemestrina monkeys #2, #3, #4, #7, #1 and #5 are shown in lanes A-D, I, and J 

10 respectively. Results from RF samples from a M. mulatta monkey is shown in lanes G & H. Results from 
peripheral blood lymphocytes of unaffected SRV2-negative M. nemestrina monkeys are shown in lanes E 
& F. Samples were assayed using nested PCR as follows: for the M. nemestrina samples, outer primers 
were VASGA and PEARB; inner primers were PEARB and PIEAB. For the M. mulatta samples, outer 
primers were FVEGA and KVIYB; inner primers were SPKDA and ASPDB. In this and other experiment, 

15 we found that the presence of amplification product correlates with the source of the samples in two ways: 
First, amplification was virus-specific (the RFHVMn specific oligonucleotides failed to amplify sequence 
from the M. mulatta RF lesion, but the RFHVMm specific oligonucleotides did. Second, M. nemestrina 
samples absent of RF-related symptoms did not yield reaction product, even when other viruses were 
present. A variety of tissues, including thymus, bone marrow, spleen, salivary gland, liver, mesenteric 

20 lymph node, ileocecal junction, duodenum, kidney and gonads naturally infected with SRV-2 were negative 
for the presence of RFHVMn sequences. 

Example 12: Other human-infecting gamma herpes DNA polymerase sequences of the 
RFHV/KSHV subfamily 

25 

Human tissue samples suspected of containing a previously undescribed gamma herpes virus, 
particularly fibroproliferative conditions, lymphocyte malignancies, and conditions associated with 
immunodeficiency and immunosuppression, such as acute respiratory disease syndrome (ARDS), are 
preserved by freezing, and the DNA is extracted as in Example 2. Two rounds of PCR amplification are 

30 conducted using the three herpes virus oligonucleotide primers, DFASA, VYGA and GDTD1B, according 
to Example 3. Alternatively, subfamily-specific (Type 2) primers may be used as described earlier in this 
example in the discovery of RFHV2. 

The amplified polynucleotide is electrophoresed in agarose and blotted onto a nylon membrane. The 
blot is hybridized with a probe comprising the polynucleotide fragment obtained from the RFHV 

35 polynucleotide encoding DNA polymerase (residues 330-501 of Figure 1), labeled with 32 P. The 
hybridization reaction is done under conditions that will permit a stable complex forming between the probe 
and DNA polymerase from a herpes virus, but not between the probe and endogenous eukaryotic DNA 
polymerase. The conditions will require approximately 60% identity between hybridizing segments of the 
probe and the target for a stable complex to form. These conditions are calculated using the formula given 

40 earlier, depending on the length and sequence of the probe and the corresponding sequence of the target. 
The conditions are estimated to be: a) allowing the probe to hybridize with the target in 6 x SSC (0.15 M 
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NaCI, 15 mM sodium citrate buffer) at room temperature in the absence of formamide; and b) washing 
newly formed duplexes for a brief period (5-10 min) in 2 x SSC at room temperature. 

Amplified polynucleotides that hybridize to the labeled probe under these conditions are selected for 
further characterization. The expected size is 236 base pairs for the amplified inner fragment including the 
primer-binding regions, for a virus that has no insertions or deletions relative to RFHV or KSHV, and has 
been amplified using VYGA and GDTD1B as inner primers. The sequence of the fragment is determined 
as in Example 4. Samples containing fragments different from RFHV or KSHV are selected for 
determination of the entire DNA polymerase gene sequence by a method similar to that in Example 9. 
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Protein 


This invention 






Herpes virus oligonucleotides 


DNA 


This invention 


; : : : ;: ; 13r1.jB;;:::;::. 




RFHV specific oligonucleotide 


DNA 


This invention 


| ;p1i^^:x:: 




KSHV specific oligonucleotide 


DNA 


This invention 






RFHV/KSHV subfamily 
oligonucleotide 


DNA 


This invention 


23 


eHV2 


DNA polymerase 


DNA 


Genbank locus EHVU20824 


24 


sHV1 


DNA polymerase 


DNA 


Genbank locus HSVSPOLGBP 




EBV 


DNA polymerase 


DNA 


Genbank locus EBV 




hCMV 


DNA polymerase 


DNA 


Genbank locus HS5POL 




hHV6 


DNA polymerase 


DNA 


Genbank locus HH6DNAPOL 




hVZV 


DNA polymerase 


DNA 


Genbank locus HEVZVXX 




hHSV1 


DNA polymerase 


DNA 


Genbank locus HEHSV1DP 




eHV2 


DNA polymerase 


Protein 


Genbank locus EHVU20824 


mmm. 


sHV1 


DNA polymerase 


Protein 


Genbank locus HSVSPOLGBP 




EBV 


DNA polymerase 


Protein 


Genbank locus EBV 




hCMV 


DNA polymerase 


Protein 


Genbank locus HS5POL 




hHV6 


DNA polymerase 


Protein 


Genbank locus HH6DNAPOL 




hVZV 


DNA polymerase 


Protein 


Genbank locus HEVZVXX 




hHSV1 


DNA polymerase 


Protein 


Genbank locus HEHSV1DP 




hHSV2 


DNA polymerase 


Protein 


PIR locus DJBE21 




eHV1 


DNA polymerase 


Protein 


PIR locus DJBEC3 




mCMV 


DNA polymerase 


Protein 


PIR locus DJBEMC 


40 


gpCMV 


DNA polymerase 


Protein 


PIR locus L25706-B 


7^41 ;t 


iHV1 


DNA polymerase 


Protein 


PIR locus DJBEI1 


42 • 


hHV6 


DNA polymerase segment 


DNA 


Figure 3 


43 


hCMV 


DNA polymerase segment 


DNA 


Figure 3 


44 


gpCMV 


DNA polymerase segment 


DNA 


Figure 3 


•::,.::45. 


mCMV 


DNA polymerase segment 


DNA 


Figure 3 


46 


hHSV1 


DNA polymerase segment 


DNA 


Figure 3 
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Isiill 1 

ilpllii 








^illi^ii 











hHSV2 

III IU V 4Ui 


DNA nnlvmpraQ^ <;pnmpnt 


DNA 


P in ii ret "\ 

rigure o 


■: :'A*:%v?W : : : : : : ; : 


hV7V 

1 1 VcV 


l-nnm polymerase segment 


DMA 
UNA 


rigure o 




eHV2 


DNA nnlvmpra^p Qpnmpnt 


DNA 


Pint iro *^ 




hEBV 


DNA Dolvmerase seament 


DNA 


Fioure 3 




sHV1 


DNA polymerase segment 


DNA 


Fiaure 3 




iHV1 


DNA polymerase segment 


DNA 


Figure 3 




hHV6 


DNA polymerase seament 


DNA 


Fiaure 4 


• : «54S; : : 


hCMV 


DNA Dolvmerase spament 


DNA 


Pjrjl 1TA 4 


. ,. : . : .:.;.>99:-: : : 


ypv_/ivi v 


una polymerase segment 


UNA 


rigure ** 




muMv 


DNA polymerase segment 


DNA 


Figure 4 


• .v. 57 


hHSV1 


DNA polymerase segment 


DNA 


Figure 4 




hVZV 


DNA polymerase segment 


DNA 


Figure 4 | 


:'x : :-* ; : : Crt 


eHV2 


DNA polymerase segment 


DNA 


Figure 4 


.v-f: ; ; DUx x : :.x 


urn\ / 

nbDv 


DNA polymerase segment 


UNA 


Figure 4 


:':v::::B1;:: 7: x ''"- 
: ,:>W:::P1:: 


e U\/1 

snv i 


una polymerase segment 


UNA 


rigure 4 


..xx^RO^x-xx 
x..DZx-x : xx 


iUI\/i 

In VI 


una polymerase segment 


PiKI A 

UNA 


rigure 4 


x:x-::p5*>|:;:;x: 


nnvD 


una polymerase segment 


UNA 


rigure o 




noivi v 


MM A F\ft*^l*t IWHAf AftA MiMh M J frV"|Mk.t' l %t 

una polymerase segment 


r»Ki a 

UNA 


ClA)| IPA £Z 

rigure o 


DO :, 


gpoivj v 


una polymerase segment 


UNA 


^T|MI| IpA C 

rigure o 


xixxfifixx:. 
•:x". DO 


mOMv 


uina polymerase segment 


r\M A 
UNA 


rigure o 


■ •/■xx-D. /•••■;' 


hUIO\/i 

nnovl 


una polymerase segment 


UNA 


rigure o 


: DO 


nvzv 


una polymerase segment 


r*\Ki a 
UNA 


rigure o 


vx: CQ . 


envz 


una polymerase segment 


PkKI A 

UNA 


rigure o 


iXJ ■ 


Snv i 


una polymerase segment 


HMA 
UNA 


rigure o 


•v xTI 

x / I .: x-; • 


uco\/ 

ntcsv 


una polymerase segment 


HMA 
UNA 


rigure o 




!UI\/i 


una polymerase segment 


HMA 
UNA 


lT 2 M ■ ■ ft»^X C 

rigure o 


73 


IAETVTL 


gamma-herpes antigen 


Protein 


This invention 


74-79 




RFHV/KSHV subfamily antigens 


Protein 


This invention 


80-103 




RFHV or KSHV specific antigens 


Protein 


This invention 


104 


SIIQB 


KSHV specific oligonucleotide 


DNA 


This invention 




QAHNA 


Gamma herpesoligonucleotide 


DNA 


This invention 


106 


QAHNB 


Gamma herpesoligonucleotide 


DNA 


This invention 


109 




RFHV/KSHV subfamily specific 
oligonucleotides 


DNA 


This invention 


110 


RFHV 
fragment 


Shared polynucleotide sequence 


DNA 


This invention 
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Iflsiiii 

liiiii 


lll||i|i!SliPl 


!l!!!!lifB 










Dp|_|\/ 

r\rrl V 

fragment 


onarea polynucleotide sequence 


UNA 


This invention 


112 


RFHV 
fragment 


Shared polypeptide sequence 


Protein 


This invention 




KSHV 

fr?inm*ant 

1 1 cay 1 1 id ii 


Shared polypeptide sequence 


Protein 


This invention 


;: : ::: : : : : : : : : : l.;:f.^r : ::: : : : >: : : 


fxonv 

fragment 


rxonv iragment 


UNA 


i nis invention 


lllllll 


KSHV 
fragment 


KSHV fragment 


Protein 


This invention 




KSHV 


DNA polymerase segment 


DNA 


This invention 




KSHV 


DNA polymerase segment 


Protein 


This invention 




RFHV2 


DNA polymerase PCR segment 


DNA 


This invention 




RFHV2 


DNA polymerase PCR segment 


Protein 


This invention 


;x:::;:y::I^U:Vx;:;.:. 




una polymerase vanant 


Dent /-tin 

Protein 


This invention 




KSHV 


DNA polymerase variant 


Protein 


This invention 




KSHV 


DNA polymerase variant 


Protein 


This invention 


:x::^123:i:- : ;; 


KSHV 


DNA polymerase variant 


Protein 


This invention 






Herpes virus (Type 1) 
oligonucleotides 


DNA 


This invention, Table 1 0 






KSHV specific oligonucleotides 


DNA 


This invention, Table 1 1 


i5om 




RFHV2 specific oligonucleotides 


DNA 


This invention, Table 14 



= 



SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GTGTTCGACT TTGCTAGTCT TTACCCCAGC ATCATGCAGG CACACAACCT CTGTTATTCT 60 

ACCCTGATTA CAGGAAGCGC CCTACACGGG CACCCCGAAC TGACCCCCGA CGACTACGAA 120 

5 ACCTTCCACC TGAGCGGGGG AACGGTACAC TTTGTAAAAA AGCACGTCCG CGAGTCACTA 180 

CTGTCCAAAC TGCTCACAAC ATGGCTGGCC AAGAGGAAAG AGATCCGCAA AAATTTAGCC 240 

TCGTGCACAG ACCCCACCAT GCGCACCATA CTGGATAAAC AACAGCTGGC CATCAAGGTC 300 

ACATGTAACG CGGTGTACGG GTTCACCGGC GTCGCTTCCG GCATCCTACC GTGCCTGAAC 360 

ATCGCAGAGA CGGTGACCCT CCAGGGCAGG AAAATGCTGG AAACGTCTCA GGCGTTCGTA 420 

1 0 GAGGGAATCT CGCCAACGGC ACTGGCAGAC CTACTGCAGC GACCGATCGA GGCGTCTCCG 480 

GAAGCCAGGT TTAAAGTGAT ATACGGCGAC ACCGACTCCG TGTTTGTCGC ATGCCG 536 

SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Val Phe Asp Phe Ala Ser Leu Tyr Pro Ser lie Met Gin Ala His Asn 

15 Leu Cys Tyr Ser Thr Leu He Thr Gly Ser Ala Leu His Gly His Pro 

Glu Leu Thr Pro Asp Asp Tyr Glu Thr Phe His Leu Ser Gly Gly Thr 
Val His Phe Val Lys Lys His Val Arg Glu Ser Leu Leu Ser Lys Leu 
Leu Thr Thr Trp Leu Ala Lys Arg Lys Glu He Arg Lys Asn Leu Ala 
Ser Cys Thr Asp Pro Thr Met Arg Thr He Leu Asp Lys Gin Gin Leu 

20 Ala He Lys Val Thr Cys Asn Ala Val Tyr Gly Phe Thr Gly Val Ala 

Ser Gly He Leu Pro Cys Leu Asn He Ala Glu Thr Val Thr Leu Gin 
Gly Arg Lys Met Leu Glu Thr Ser Gin Ala Phe Val Glu Gly He Ser 



-78- 



WO 97/04105 



PCT/US96/11688 



Pro Thr Ala Leu Ala Asp Leu Leu Gin Arg Pro He Glu Ala Ser Pro 
Glu Ala Arg Phe Lys Val He Tyr Gly Asp Thr Asp Ser Val Phe Val 
Ala Cys 



10 



15 



20 



25 



SEQUENCE DESCRIPTION: 
GTGTTCGACT TTGCTAGCCT 
ACACTGATAC CCGGCGATTC 
ACCTTTGTCC TCAGCGGAGG 
CTTACCAAGC 7TCTGACGGT 
TCATGCACGG ACCCCGCACT 
ACCTGCAACG CGGTTTACGG 
ATAGCGGAGA CCGTGACACT 
GAGGCCATCT CGCCGGAACG 
GACGCCCGAT TCAAGGTCAT 



SEQ ID N0:3: 
CTACCCCAGT ATCATCCAAG 
GCTCCACCTG CACCCACACC 
TCCGGTCCAC TTTGTAAAAA 
ATGGCTCGCG AAGAGAAAAG 
GAAAACTATT CTAGACAAAC 
CTTCACGGGC GTTGCCTCTG 
ACAAGGGCGA AAGATGCTGG 
CCTAGCGGGT CTCCTGCGGA 
ATACGGCGAC ACCGACTCCG 



CGCACAACTT 
TCTCCCCGGA 
AACACAAAAG 
AAATAAGAAA 
AACAACTGGC 
GCATACTGCC 
AGAGATCTCA 
GGCCAATAGA 
TGTTTGTCGC 



GTGCTACTCC 60 

CGACTACGAA 120 

GGAGTCCCTT 180 

GACCCTGGCA 240 

CATCAAGGTT 300 

TTGCCTAAAC 360 

GGCCTTTGTA 420 

CGTCTCACCC 480 

ATGCCG 536 



SEQUENCE 
Val 
Leu 
His 
Val 
Leu 
Ser 
Ala 
Ser 
Gly 
Pro 
Asp 
Ala 



30 



DESCRIPTION 
Phe Asp Phe 
Cys Tyr Ser 
Leu Ser Pro 
His Phe Val 
Thr Val Trp 
Cys Thr Asp 
He Lys Val 
Gly He Leu 
Arg Lys Met 
Glu Arg Leu 
Ala Arg Phe 
Cys 



: SEQ ID 

Ala Ser 
Thr Leu 
Asp Asp 
Lys Lys 
Leu Ala 
Pro Ala 
Thr Cys 
Pro Cys 
Leu Glu 
Ala Gly 
Lys Val 



N0:4: 
Leu Tyr 
He Pro 
Tyr Glu 
His Lys 
Lys Arg 
Leu Lys 
Asn Ala 
Leu Asn 
Arg Ser 
Leu Leu 
He Tyr 



Pro Ser He 
Gly Asp Ser 
Thr Phe Val 
Arg Glu Ser 
Lys Glu He 
Thr He Leu 
Val Tyr Gly 
He Ala Glu 
Gin Ala Phe 
Arg Arg Pro 
Gly Asp Thr 



He Gin 
Leu His 
Leu Ser 
Leu Leu 
Arg Lys 
Asp Lys 
Phe Thr 
Thr Val 
Val Glu 
He Asp 
Asp Ser 



Ala His Asn 
Leu His Pro 
Gly Gly Pro 
Thr Lys Leu 
Thr Leu Ala 
Gin Gin Leu 
Gly Val Ala 
Thr Leu Gin 
Ala He Ser 
Val Ser Pro 
Val Phe Val 



SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GTGTTCGACT TYGCNAGYYT NTAYCC 

35 SEQUENCE DESCRIPTION: SEQ ID N0:6: 
GTGTTCGACT TYCARAGYYT NTAYCC 

SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ACGTGCAACG CGGTGTAYGG NKTNACNGG 

40 

SEQUENCE DESCRIPTION: SEQ ID N0:8: 
ACGTGCAACG CGGTGTACGG SGTSACSGG 

SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
45 ACGTGCAACG CGGTGTA 

SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TAYGGNGAYA CNGACTCCGT GTTTGTCGCA TGCCG 

50 SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CGGCATGCGA CAAACACGGA GTCNGTRTCN CCRTA 

SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGGCATGCGA CAAACACGGA 

55 

SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGTCGCTTCC GGCATCCTAC C 

SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
60 GGCATCCTAC CGTGCCTGAA C 

SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCGGAGACGC CTCGATCGGT C 



26 



26 



29 



29 



17 



35 



35 



20 



21 



21 



21 
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SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

AACCTGGCTT CCGGAGACGC C 21 

SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
5 GCGTTGCCTC TGGCATACTG 20 

SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CTGCCTTGCC TAAACATAGC G 21 

10 SEQUENCE DESCRIPTION: SEQ ID N0:19: 

GGTGAGACGT CTATTGGCCT 20 



15 



SEQUENCE DESCRIPTION: SEQ ID N0:20: 

AATCGGGCGT CGGGTGAGAC G 21 
SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GTCGCCTCTG GCATCCTNCC NTGYCTNAA 29 



SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

20 CAGGGCCGGA AGATGCTGGA RACRTCNCAR GC 32 

SEQUENCE DESCRIPTION: SEQ ID N0:23: 

ATGAGTTTCT ACAACCCCTA CTTGGTCAAG AGGACCTTTC TTAAAAAGGC CGCCCCCTCG 60 

CGGCCGACCA AGGAATATAC CAGGATAATT CCAAAATGCT TCAAGACCCC AGGCGCCGCG 120 

25 GGGGTGGTGC CCCACACCAG CACCCTGGAC CCGGTGTGCT TCGTGGGGGA CAAGGAGACC 180 

CCCATCCTGT ACGGGGACGG GAGCAGGAGC CTGTGGAGCG CGGGTGGGCG GGGCGGGCCG 240 

GGGACGGGCG CGGGCCAGGG CCACACGCCT GTGGCCCTGA CCTTCCACGT CTATGACATA 300 

ATAGAGACGG TGTACGGCCA GGACAGGTGC GACCACGTGC CCTTTCAGTT TCAGACGGAC 360 

ATCATCCCCA GCGGGACGGT CCTCAAGCTG CTGGGTCGCA CCTCGGACGA CCGCAGCGTG 420 

30 TGCGTGAACG TGTTCAGGCA GGAGCTGTAC TTTTACGTGC GCGTGCCCGA GGGGCTCAAG 480 

CTGGACTTTC TCATCCAGCA GTGCTCGCGG GAGAACTTTA ACTTTAGCCA GGGCCGGTAC 540 

CGATATGAGA AAACAAGCAA GCGCGTGTTG CGCGAGTACT GCGTCGAGGC GCGGGAGGTG 600 

TACCGGGTGT TCGCGTCGAG CCAGGGGTTC GTGGACCTCC TGGCCGGGGG GCTCACGGCC 660 

GCGGGGTGCG AGGTCTTCGA GACAAACGTG GACGCGGCCA GGCGGTTCAT CATAGACAAC 720 

35 GGGTTCTCCA CCTTCGGGTG GTACTCGTGC GCGGCGGCCG TCCCGCGCCA GGGGGGCGCG 780 

GCCAGGGACT CCTGGACGGA GTTGGAGTAC GACTGCGCCG CGGGGGACCT GGAGTTTCAC 840 

GCGGGGCGGG CGGACTGGCC GGGCTACAAC GTCCTCTCCT TCGATATAGA GTGCCTGGGG 900 

GAGAACGGGT TCCCCAACGC GAGCAGGGAC GAGGACATGA TCCTGCAGAT CTCCTGCGTG 960 

ATCTGGAAGG CGGGGTCGGG GGAGGCGCCC AGGAGCGTGC TCCTGAACCT GGGCACGTGC 1020 

40 GAGGAGATAG AGGGGGTGGA GGTGTACCAG TGCCCCTCGG AGCTGGACCT GCTCTACCTC 1080 

TTTTTCACCA TGATCAGGGA CGCGGACGTG GAGTTTGTGA CGGGCTACAA CATCTCCAAC 1140 

TTTGACTTCC CCTACGTGAT AGACAGGGCC ACGCAGGTGT ACAACCTGAA CCTGAAAGAG 1200 

TTCACCCGGG TGCGCTCCTC GTCCATCTTC GAGGTGCACA AGCCCAAGAA CAGCTCAGCG 1260 

GGCTTCATGC GCGCGGTGTC CAAGGTCAAG GTGGCCGGGG TGGTGCCCAT AGACATGTAC 1320 

45 CAGGTGTGCA GGGACAAGCT GAGCCTGTCC AACTACAAGC TGGACACGGT GGCCGGGGAG 1380 

TGCGTGGGCG CCAAGAAGGA GGACGTCTCC TACAAGGAGA TCCCCCACCT GTTCAGGCAG 1440 

GGACCGGGGG GCAGGGCCAG GCTGGGGCTG TACTGCGTCA AGGATTCCGC CCTGGTGCTG 1500 

GACCTGCTGA GGTACTTTAT GACGCACGTG GAGATCTCTG AGATAGCCAA GATAGCCAAG 1560 

ATCCCCACGC GGCGGGTGCT CACGGACGGG CAGCAGATCA GGGTCTTCTC CTGCCTGCTG 1620 

50 GACGTGGCCG GGCGGGAGGG CTACATCCTG CCAGTGGACA GGCACGCGGA CGCGGAGGGC 1680 

TACCAGGGGG CCACGGTCAT AGACCCCTCG CCCGGGTTCT ACAACACCCC GGTGCTGGTG 1740 

GTGGACTTTG CCAGCCTGTA CCCCACCATC ATCCAGGCCC ACAACCTCTG CTACTCCACC 1800 

ATGATCCCCG GAGACAGGCT GTGCCTGCAC CCGCACCTCG GGCCGGGCGA CTACGAGACC 1860 

TTTGAGCTCG CGAGCGGGCC GGTGCACTTT GTCAAGAAGC ACAAGGCGGT CTCGCTGCTG 1920 

55 GCCACGCTGC TGAACGTGTG GCTGGCCAAG AGGAAGGCCA TCAGGCGCGA GCTGGCCACG 1980 

GTCTCGGACG AGGCCGTCAG GACCATCCTG GACAAGCAGC AGCTGGCCAT CAAGGTCACC 2040 

TGCAACGCGG TGTACGGGTT CACGGGCGTG GCCTCGGGCA TCCTGCCCTG TCTCAAGATA 2100 

GCCGAGACGG TCACCTTCCA GGGCAGGCGC ATGCTGGAGA ACTCCAAGCG CTACATAGAG 2160 

GGGGTGACCC CCGAGGGGCT GGCAGACATA TTGGGCAGGC GGGTGGAGTG CGCCCCCGAT 2220 

60 GCCAGTTTTA AGGTCATCTA CGGGGACACG GACTCCCTGT TTATCCACTG CCGGGGCTAC 2280 

CGCCCAGAGC AGGTCACGGG GTTCTGCGAC GAGCTGGCCG CTCACATGAC CCGAACCCTG 2340 

TTCGTGGACC CCATCAAGCT GGAGGCCGAA AAGACCTTCA AGTGCCTGAT CTTACTGACC 2400 

AAAAAGAGGT ACATAGGCAT GATGACCACC GACAGGCTGC TCATGAAGGG GGTGGACCTG 2460 

GTGCGCAAGA CGGCGTGCAG GTTCGTGCAG GAGACCACCA AGGCCATCCT GGACCTGGTG 2520 
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ATGGGGGACG AGGCGGTGCG GGCGGCGGCC GAGCGCCTGT GCGCCATGAG GGTGGAGGAG 2580 

GTGTGCGCGC GGGGGCCCCC CGTCGGGTTC CTCAAGGTGG TGGACATCCT CAACGACAGC 2640 

TACAGGAAAC TAAGGCTCAA CCGGGTGCCC GTGGGCCAGC TGTCCTTCTC CACCGAGCTG 2700 

AGCAGGCCCA TCTCCTATTA CAAGACCCTG ACCCTGCCCC ACCTGGTGGT GTACCACAAG 2760 

5 ATCATGCAGA GGAACGAGGA GCTCCCCCAG ATCCACGATA GGATAGCCTA CGTGTTTGTG 2820 

CAGTCCCCCA AGGGGAAGCT GAGGTCCGAG ATGGCCGAGG ACCCCGCCTA CGCGGCCCAG 2880 

CACAACATCC CCCCGGCCGT GGACCTGTAC TTTGACAAGG TCATACACGG GGCGGCCAAC 2940 

ATCCTGCAGT GCCTGTTTGA GAACGACAGC GATAAGGCCG CGAGGGTGCT GTACAACTTT 3000 

GCGGACCTGC CCCCCGACGA CCTGTGA 3027 

10 

SEQUE NCE D ESCRIPTION: SEQ ID NO: 24: 

ATGGATTTTT ACAACCCATA TCTAAGTAAA AAGCCAACAG ATACAAAGAC ACCTAAGCTT 60 

CATACAACTA GACAATCTAT ATGTAGGTTA GTCCCTAAAT GTTTTAGAAA TCCTACTGAA 120 

AAAGGCGTAG TGTCTGTGTC TTCTTTTGCT CTTCCAACTT ACTTTTTCAA AGGTAATGAG 180 

1 5 AATAAAGTAT ATCTTGAAAA TGGTAAGTCT ATGTGGCACT TAAGAAGACC GTGTAAGAAC 240 

GCGTTGCTAG AAGAACAATC TATTACGTTC CATATTTATG ACATAGTAGA AACTACTTAT 300 

TCAGAAGACA GATGTAACGA TATTCCTTTT AAGTTTCAAA CAGACATAAT ACCTAATGGA 360 

ACAGTGTTGA AAC TACTTGG AAGAACACTA GAGGGTGCGA GCGTATGTGT TAACGTGTTT 420 

GGACAAAGAA ATTATTTTTA TGTTAAAGTT CCGGAAGGTG GCAACATAAC CTATCTTATA 480 

20 AAACAAGCTT TGAATGAAAA ATTTAGCCCA TCTTGTGCAT ACCAAACTGA AGCAGTAAAG 540 

AAGAAGATAC TATCTAGATA TGATCCAGAA GAACATGATG TGTTTAAGGT GACAGTGTCT 600 

TCTTCCCTTT CTGTTTATAA GATATCAGAT TCTTTAGTGT CTAATGGTTG TGAAGTTTTT 660 

GAAACAAATG TAGATGCTAT AAGAAGATTT GTAATTGATA ATGACTTTTC TACATTTGGT 720 

TGGTACACAT GTAAGTCTGC ATGTCCTCGA ATCACAAATA GAGACTCTCA TACTGACATT 780 

25 GAGTTTGACT GCGGGTACTA TGACTTAGAA TTTCATGCTG ATAGAACAGA ATGGCCACCT 840 

TACAACATAA TGTCTTTTGA TATAGAATGT ATAGGAGAAA AAGGATTTCC TTGTGCAAAA 900 

AATGAAGGAG ATTTAATAAT TCAGATTTCA TGTGTGTTTT GGCACGCTGG GGCGCTTGAT 960 

ACAACTAGAA ATATGCTATT ATCTTTAGGA ACGTGCTCAG CTGTTGAAAA TACTGAAGTT 1020 

TATGAGTTCC CTAGTGAAAT AGACATGCTG CATGGGTTTT TTTCATTAAT TAGAGACTTT 1080 

30 AATGTTGAAA TAATTACTGG TTATAATATT TCTAACTTTG ACTTACCTTA TCTAATTGAT 1140 

AGAGCTA CTC AAATTTATAA TATAAAGCTA TCTGATTATT CAAGAGTTAA AACAGGGTCT 1200 

ATTTTTCAAG TTCATACACC AAAAGATACA GGAAATGGGT TTATGAGATC TGTCTCTAAA 1260 

ATAAAAATTT CAGGAATTAT AGCAATTGAC ATGTACATTG TGTGCAAAGA CAAACTCAGT 1320 

CTGTCTAATT ACAAGCTTGA TACA GTTGCT AATCACTGTA TTGGTGCAAA AAAGGAAGAT 1380 

35 GTGTCTTACA AAGATATTAT GCCTCTTTTT ATGTCCGGAC CAGAAGGCAG AGCTAAGATA 1440 

GGACTATACT GTGTAATAGA TTCTGTTCTT GTGATGAAAC TTTTGAAATT TTTTATGATT 1500 

CATGTTGAAA TTTCTGAGAT AGCAA AACTC GCTAAAATCC CCACAAGAAG AGTTCTTACA 1560 

GATGGGCAAC AAATAAGAGT TTTTTCTTGT CTGCTTGCAG CAGCTCGTGC AGAAAACTAT 1620 

ATACTGCCTG TGTCAAATGA TGTCAATGCG GATGGGTTTC AAGGAGCTAC CGTTATAAAT 1680 

40 CCAATTCCTG GATTTTATAA CAATGCTGTA TTAGTAGTAG ACTTTGCTAG CCTGTATCCT 1740 

AGTATTATAC AAGCTCATAA TCTATGCTAC TCCACTCTTA TACCCCACCA TGCTTTACAC 1800 

AACTACCCTC ACTTAAAATC TAGTGACTAT GAGACTTTCA TGCTCAGTTC TGGACCTATA 1860 

CACTTTGTGA AAAAACACAT TCAGGCATCT CTTCTATCTA GGCTCTTAAC TGTGTGGCTT 1920 

TCTAAGAGAA AAGCTATTAG GCAAAAGCTT GCTGAATGTG AAGACCTAGA CACTAAAACT 1980 

45 ATTCTAGATA AACAGCAACT CGCTATTAAA GTMCTTGTA ATGCTGTGTA TGGGTTTACA 2040 

GGAGTTGCGT CAGGCTTGCT GCCATGCATA AGCATTGCAG AGACTGTTAC TCTCCAAGGC 2100 

CGGACGATGC TAGAAAAATC AAAAATATTC ATAGAAGCAA TGACACCTGA TACACTTCAA 2160 

GAAATTGTTC CTCATATAGT GAAGCATGAA CCTGATGCGA AGTTCAGAGT CATATATGGA 2220 

GACACAGACT CTCTATTTGT AGAATGTGTT GGGTATTCTG TAGACACAGT TGTTAAATTT 2280 

50 GGAGATTTCT TAGCTGCTTT TACTTCTGAA AAGCTCTTTA ATGCTCCTAT AAAGTTAGAG 2340 

TCAGAAAAAA CATTTCAGTG TTTGCTATTG CTTGCTAAAA AAAGATACAT TGGAATACTG 2400 

TCAAATGACA AATTGCTTAT GAAAGGTGTT GACTTAGTGA GAAAAACTGC TTGTAAATTT 2460 

GTTCAAAATA CTAGCTCAAA AATTCTTAAT CTTATACTTA AAGACCCTGA GGTAAAAGCA 2520 

GCT GCTCA GC mTGTCAAC AAAAGATCCA GACTATGCTT TTAGAGAAGG GCTTCCTGAT 2580 

55 GGGTTTTTGA AAGTGATAGA CATTTTAAAT GAAAGCCACA AAAACCTCAG AACTGGGCAA 2640 

GTGCCGGTAG AGGAATTAAC ATTTTCTACA GAATTGAGTA GACCTATTTC TTCTTACAAA 2700 

ACTGAAAACT TGCCTCATTT AACTGTTTAT AAAAAAATTA TTACAAGGCA TGAAGAACCT 2760 

CCACAAGTTC ATGACAGAAT CCCATACGTT TTTGTAGGCA AGACTACATC ATGCATATCA 2820 

AACATGGCTG AAGACCCAAC ATACACGGTT CAAAATAATA TTCCAATTGC AGTGGATCTA 2880 

60 TATTTTGATA AACTTATTCA CGGGGTAGCT AACATAATAC AGTGTCTCTT TAAAGACAGC 2940 

AGTAAAACTG TGTCTGTTTT GTATAATTTT GTATCAACTC CTGTTTTATT TTCTTACGAG 3000 

CTTCTAACTG ATCATTCTGT AAAAGCATAA 3030 
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SEQUENCE DESCRIPTION: SEQ ID N0:25: 

ATGTCTGGGG GACTCTTCTA TAACCCTTTC CTAAGACCTA ATAAAGGCCT TCTGAAAAAG 60 

CCTGACAAGG AGTACCTGCG TCTCATTCCC AAGTGTTTCC AGACACCAGG CGCCGCAGGG 120 

5 GTGGTGGATG TGCGGGGGCC TCAGCCCCCC CTGTGCTTCT ACCAAGACTC CCTGACGGTG 180 

GTGGGGGGTG ACGAGGATGG AAAGGGCATG TGGTGGCGCC AGCGTGCCCA AGAGGGCACG 240 

GCAAGGCCGG AGGCAGACAC CCACGGAAGC CCTCTGGACT TCCATGTCTA CGACATACTC 300 

GAGACGGTGT ACACGCACGA GAAATGCGCC GTCATTCCAT CGGATAAACA GGGGTATGTG 360 

GTGCCATGTG GCATCGTCAT CAAGCTACTG GGCCGGCGCA AGGCCGATGG GGCCAGCGTG 420 

1 0 TGTGTGAACG TGTTTGGGCA GCAGGCCTAC TTCTACGCCA GCGCGCCTCA GGGTCTGGAC 480 

GTGGAGTTTG CAGTCCTCAG CGCCCTCAAG GCCAGCACCT TCGACCGCAG GACCCCCTGC 540 

CGGGTCTCGG TGGAGAAGGT CACGCGCCGT TCCATTATGG GCTACGGCAA CCATGCCGGC 600 

GACTACCACA AGATCACCCT CTCCCATCCC AACAGTGTGT GTCACGTGGC CACGTGGCTG 660 

CAAGACAAGC ACGGGTGTCG GATCTTTGAG GCCAACGTGG ATGCCACGCG CCGCTTTGTC 720 

15 CTGGACAATG ACTTTGTCAC CTTTGGCTGG TACAGCTGCC GCCGCGCCAT CCCCCGCCTC 780 

CAGCACCGGG ACTCGTACGC CGAGCTCGAG TACGACTGTG AGGTGGGCGA CCTCTCGGTC 840 

CGGCGTGAAG ACAGCTCCTG GCCCTCCTAC CAGGCCCTGG CCTTCGATAT CGAGTGTCTG 900 

GGGGAGGAGG GCTTCCCCAC GGCCACCAAC GAGGCTGACC TGATCCTGCA GATATCCTGC 960 

GTCCTCTGGT CGACAGGGGA GGAGGCCGGG CGCTATAGGC GCATCCTGCT GACGCTGGGC 1020 

20 ACCTGCGAAG ACATAGAGGG GGTTGAGGTC TACGAGTTCC CATCGGAGCT GGACATGCTC 1080 

TACGCCTTCT TCCAGCTCAT CAGAGACCTC AGCGTGGAGA TTGTGACCGG CTACAACGTG 1140 

GCCAACTTTG ACTGGCCCTA CATTCTGGAC AGAGCCAGGC ACATCTACAG CATCAACCCA 1200 

GCCTCTCTGG GCAAAATTAG GGCTGGGGGC GTCTGCGAGG TCAGGCGACC CCATGATGCG 1260 

GGCAAGGGCT TCTTGCGGGC CAACACCAAG GTCCGCATCA CCGGCCTCAT CCCCATCGAC 1320 

25 ATGTACGCCG TGTGCCGGGA CAAGCTCAGC CTCTCAGACT ACAAGCTGGA CACAGTAGCC 1380 

AGGCACCTAC TGGGGGCCAA GAAGGAGGAT GTGCATTACA AGGAGATTCC TCGCCTCTTT 1440 

GCAGCGGGCC CCGAGGGGCG CAGGCGGCTC GGCATGTACT GCGTGCAGGA CTCGGCCCTG 1500 

GTCATGGATC TGCTAAACCA TTTCGTGATC CACGTGGAGG TGGCAGAGAT TGCCAAGATC 1560 

GCTCACATCC CCTGCAGGCG GGTGCTGGAC GATGGGCAGC AGATCCGCGT GTTCTCCTGC 1620 

30 CTCCTGGCGG CCGCCCAAAA GGAAAACTTT ATCCTGCCCA TGCCCTCGGC CTCTGACCGG 1680 

GACGGCTACC AGGGGGCCAC CGTCATCCAG CCCCTGTCCG GATTCTACAA CTCCCCGGTT 1740 

CTGGTGGTGG ACTTTGCCAG CCTCTACCCG AGCATCATTC AGGCTCATAA TCTCTGTTAT 1800 

TCTACCATGA TAACGCCGGG AGAAGAGCAC AGGCTAGCCG GCCTGCGCCC GGGAGAAGAC 1860 

TATGAGTCCT TCAGGCTCAC GGGGGGCGTC TACCACTTTG TAAAGAAGCA CGTGCACGAG 1920 

35 TCCTTCTTGG CTAGTCTGTT GACCTCCTGG CTGGCCAAGC GCAAGGCCAT CAAGAAGCTG 1980 

CTGGCGGCCT GCGAGGATCC GCGCCAAAGG ACCATCCTCG ACAAGCAGCA GCTGGCCATC 2040 

AAGTGCACGT GCAACGCCGT CTACGGCTTC ACCGGGGTGG CCAACGGCCT CTTTCCCTGC 2100 

CTCTCCATCG CCGAGACGGT GACGCTGCAG GGCCGCACGA TGTTGGAGCG GGCCAAGGCC 2160 

TTCGTGGAGG CCCTGAGCCC CGCCAACCTG CAGGCCCTGG CCCCCTCCCC GGACGCCTGG 2220 

40 GCGCCCCTCA ACCCCGAGGG CCAGCTTCGA GTCATCTACG GGGACACGGA CTCGCTGTTT 2280 

ATCGAGTGCC GGGGGTTTTC AGAGAGCGAG ACCCTGCGCT TTGCCGATGC CCTGGCCGCC 2340 

CACACCACCC GGAGCCTGTT TGTGGCCCCC ATCTCCCTGG AGGCCGAGAA GACCTTCTCC 2400 

TGCCTGATGC TGATTACAAA GAAGAGATAT GTGGGGGTGC TGACGGACGG CAAGACCCTG 2460 

ATGAAGGGGG TGGAGCTCGT CCGGAAGACG GCCTGCAAGT TTGTGCAGAC ACGCTGCCGG 2520 

45 CGCGTGCTCG ACCTGGTGCT GGCGGATGCC CGGGTAAAGG AGGCGGCCAG CCTCCTCTCC 2580 

CACCGGCCCT TCCAAGAGTC ATTTACACAA GGGCTACCTG TGGGCTTTTT GCCCGTCATT 2640 

GACATCCTAA ACCAGGCCTA CACAGACCTC CGTGAAGGCA GGGTCCCCAT GGGGGAGCTC 2700 

TGCTTTTCM CGGAGCTCAG CCGCAAGCTC TCAGCCTACA AGAGCACCCA GATGCCTCAC 2760 

CTGGCCGTCT ACCAGAAGTT CGTCGAGCGC AACGAGGAAC TGCCCCAGAT CCACGACCGC 2820 

50 ATCCAGTACG TCTTTGTGGA GCCCAAGGGG GGAGTGAAGG GGGCGAGAAA GACGGAGATG 2880 

GCCGAGGACC CGGCCTACGC CGAGCGGCAC GGCGTTCCCG TGGCCGTGGA TCATTATTTC 2940 

GACAAGCTGC TCCAAGGAGC GGCCAACATC CTCCAGTGCC TCTTTGATAA CAACTCCGGG 3000 

GCCGCCCTCT CCGTCCTCCA GAATTTTACA GCCCGGCCAC CATTCTAA 3048 

55 SEQ UENCE DESCRIPTION: SEQ ID NO: 26: 

ATGTmTCA ACCCGTATCT GAGCGGCGGC GTGACCGGCG GTGCGGTCGC GGGTGGCCGG 60 

CGTCAGCGTT CGCAGCCCGG CTCCGCGCAG GGCTCGGGCA AGCGGCCGCC ACAGAAACAG 120 

TTTTTGCAGA TCGTGCCGCG AGGTGTCATG TTCGACGGTC AGACGGGGTT GATCAAGCAT 180 

AAGACGGGAC GGCTGCCTCT CATGTTCTAT CGAGAGATTA AACATTTGTT GAGTCATGAC 240 

60 ATGGTTTGGC CGTGTCCTTG GCGCGAGACC CTGGTGGGTC GCGTGGTGGG ACCTATTCGT 300 

TTTCACACCT ACGATCAGAC GGACGCCGTG CTCTTCTTCG ACTCGCCCGA AAACGTGTCG 360 

CCGCGCTATC GTCAGCATCT GGTGCCTTCG GGGAACGTGT TGCGTTTCTT CGGGGCCACA 420 

GAACACGGCT ACAGTATCTG CGTCAACGTT TTCGGGCAGC GCAGCTACTT TTACTGTGAG 480 

TACAGCGACA CCGATAGGCT GCGTGAGGTC ATTGCCAGCG TGGGCGAACT AGTGCCCGAA 540 
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CCGCGGACGC CATACGCCGT GTCTGTCACG CCGGCCACCA AGACCTCCAT CTATGGGTAC 600 

GGGACGCGAC CCGTGCCCGA TTTGCAGTGT GTGTCTATCA GCAACTGGAC CATGGCCAGA 660 

AAAATCGGCG AGTATCTGCT GGAGCAGGGT TTTCCCGTGT ACGAGGTCCG TGTGGATCCG 720 

CTGACGCGTT TGGTCATCGA TCGGCGGATC ACCACGTTCG GCTGGTGCTC CGTGAATCGT 780 

5 TACGACTGGC GGCAGCAGGG TCGCGCGTCG ACTTGTGATA TCGAGGTAGA CTGCGATGTC 840 

TCTGACCTGG TGGCTGTGCC CGACGACAGC TCGTGGCCGC GCTATCGATG CCTGTCCTTC 900 

GATATCGAGT GCATGAGCGG CGAGGGTGGT T7TCCCTGCG CCGAGAAGTC CGATGACATT 960 

GTCATTCAGA TCTCGTGCGT GTGCTACGAG ACGGGGGGAA ACACCGCCGT GGATCAGGGG 1020 

ATCCCAAACG GGAACGATGG TCGGGGCTGC ACTTCGGAGG GTGTGATCTT TGGGCACTCG 1080 

10 GGTCTTCATC TCTTTACGAT CGGCACCTGC GGGCAGGTGG GCCCAGACGT GGACGTCTAC 1140 

GAGTTCCCTT CCGAATACGA GCTGCTGCTG GGCTTTATGC TTTTCTTTCA ACGGTACGCG 1200 

CCGGCCTTTG TGACCGGTTA CAACATCAAC TCTTTTGACT TGAAGTACAT CCTCACGCGT 1260 

CTCGAGTACC TGTATAAGGT GGACTCGCAG CGCTTCTGCA AGTTGCCTAC GGCGCAGGGC 1320 

GGCCGTTTCT TTTTACACAG CCCCGCCGTG GGTTTTAAGC GGCAGTACGC CGCCGCTTTT 1380 

1 5 CCCTCGGCTT CTCACAACAA TCCGGCCAGC ACGGCCGCCA CCAAGGTGTA TATTGCGGGT 1440 

TCGGTGGTTA TCGACATGTA CCCTGTATGC ATGGCCAAGA CTAACTCGCC CAACTATAAG 1500 

CTCAACACTA TGGCCGAGCT TTACCTGCGG CAACGCAAGG ATGACCTGTC TTACAAGGAC 1560 

ATCCCGCGTT GTTTCGTGGC TAATGCCGAG GGCCGCGCCC AGGTAGGCCG TTACTGTCTG 1620 

CAGGACGCCG TATTGGTGCG CGATCTGTTC AACACCATTA ATTTTCACTA CGAGGCCGGG 1680 

20 GCCATCGCGC GGCTGGCTAA AATTCCGTTG CGGCGTGTCA TCTTTGACGG ACAGCAGATC 1740 

CGTATCTACA CCTCGCTGCT GGACGAGTGC GCCTGCCGCG ATTTTATCCT GCCCAACCAC 1800 

TACAGCAAAG GTACGACGGT GCCCGAAACG AATAGCGTTG CTGTGTCACC TAACGCTGCT 1860 

ATCATCTCTA CCGCCGCTGT GCCCGGCGAC GCGGGTTCTG TGGCGGCTAT GTTTCAGATG 1920 

TCGCCGCCCT TGCAATCTGC GCCGTCCAGT CAGGACGGCG TTTCACCCGG CTCCGGCAGT 1980 

25 AACAGTAGTA GCAGCGTCGG CGTTTTCAGC GTCGGCTCCG GCAGTAGTGG CGGCGTCGGC 2040 

GTTTCCAACG ACAATCACGG CGCCGGCGGT ACTGCGGCGG TTTCGTACCA GGGCGCCACG 2100 

GTGTTTGAGC CCGAGGTGGG TTACTACAAC GACCCCGTGG CCGTGTTCGA CTTTGCCAGC 2160 

CTCTACCCTT CCATCATCAT GGCCCACAAC CTCTGCTACT CCACCCTGCT GGTGCCGGGT 2220 

GGCGAGTACC CTGTGGACCC CGCCGACGTA TACAGCGTCA CGCTAGAGAA CGGCGTGACC 2280 

30 CACCGCTTTG TGCGTGCTTC GGTGCGCGTC TCGGTGCTCT CGGAACTGCT CAACAAGTGG 2340 

GTTTCGCAGC GGCGTGCCGT GCGCGAATGC ATGCGCGAGT GTCAAGACCC TGTGCGCCGT 2400 

ATGCTGCTCG ACAAGGAACA GATGGCGCTC AAAGTAACGT GCAACGCTTT CTACGGTTTT 2460 

ACCGGCGTGG TCAACGGTAT GATGCCGTGT CTGCCCATCG CCGCCAGCAT CACGCGCATC 2520 

GGTCG CGACA TGCTAGAGC G CACGGCGCGG TTCATCAAAG ACAACTTTTC AGAGCCGTGT 2580 

35 TTTTTGCACA All 1 1 1 1 IM TCAGGAAGAC TATGTAGTGG GAACGCGGGA GGGGGATTCG 2640 

GAGGAGAGCA GCGCGTTACC GGAGGGGCTC GAAACATCGT CAGGGGGCTC GAACGAACGG 2700 

CGGGTGGAGG CGCGGGTCAT CTACGGGGAC ACGGACAGCG TGTTTGTCCG CTTTCGTGGC 2760 

C TGACG CCGC AGGCTCTGGT GGCGCGTGGG CCCAGCCTGG CGCACTACGT GACGGCCTGT 2820 

CTTTTTGTGG AGCCCGTCAA GCTGGAGTTT GAAAAGGTCT TCGTCTCTCT TATGATGATC 2880 

40 TGCAAGAAAC GTTACATCGG CAAAGTGGAG GGCGCCTCGG GTCTGAGCAT GAAGGGCGTG 2940 

GATCTGGTGC GCAAGACGGC CTGCGAGTTC GTCAAGGGCG TCACGCGTGA CGTCCTCTCG 3000 

CTGCTCTTTG AGGATCGCGA GGTCTCGGAA GCAGCCGTGC GCCTGTCGCG CCTCTCACTC 3060 

GATGAAGTCA AGAAGTACGG CGTGCCACGC GGTTTCTGGC GTATCTTACG CCGCTTGGTG 3120 

CAGGCCCGCG ACGATCTGTA CCTGCACCGT GTGCGTGTCG AGGACCTGGT GCTTTCGTCG 3180 

45 GTGCTCTCTA AGGACATCTC GCTGTACCGT CAATCTAACC TGCCGCACAT TGCCGTCATT 3240 

AAGCGATTGG CGGCCCGTTC TGAGGAGCTA CCCTCGGTCG GGGATCGGGT CTTTTACGTT 3300 

CTGACGGCGC CCGGTGTCCG GACGGCGCCG CAGGGTTCCT CCGACAACGG TGATTCTGTA 3360 

ACCGCCGGCG TGGTTTCCCG GTCGGACGCG ATTGATGGCA CGGACGACGA CGCTGACGGC 3420 

GGCGGGGTAG AGGAGAGCAA CAGGAGAGGA GGAGAGCCGG CAAAGAAGAG GGCGCGGAAA 3480 

50 CCACCGTCGG CCGTGTGCAA CTACGAGGTA GCCGAAGATC CGAGCTACGT GCGCGAGCAC 3540 

GGCGTGCCCA TTCACGCCGA CAAGTACTTT GAGCAGGTTC TCAAGGCTGT AACTAACGTG 3600 

CTGTCGCCCG TCTTTCCCGG CGGCGAAACC GCGCGCAAGG ACAAGTTTTT GCACATGGTG 3660 

CTGCCGCGGC GCTTGCACTT GGAGCCGGCT TTTCTGCCGT ACAGTGTCAA GGCGCACGAA 3720 

TGCTGTTGA 3729 

55 

SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATGGATTCGG TGTCGTTTTT TMTCCATAT TTGGAAGCGA ATCGCTTAAA GAAAAAAAGC 60 

AGATCGAGTT ACATTCGTAT ACTTCCTCGC GGTATAATGC ATGATGGTGC GGCGGGATTA 120 

ATAAAGGATG TTTGTGACTC TGAACCGCGT ATGTTTTATC GAGACCGACA GTATTTACTG 180 

60 AGCAAAGAAA TGACCTGGCC GAGTTTGGAC ATAGCTCGGT CCAAGGATTA TGATCATATG 240 

AGGATGAAGT TTCACATATA TGATGCTGTA GAAACGTTAA TGTTTACGGA TTCGATCGAG 300 

MTCTTCCTT TTCAGTATAG ACATTTTGTG ATTCCTTCGG GGACAGTGAT TAGAATGTTT 360 

GGGAGAACTG AGGACGGTGA GAAGATCTGC GTGAACGTGT TTGGACAGGA GCAATATTTC 420 

TACTGCGAAT GCGTCGACGG AAGAAGCCTG MGGCTACTA TAAACAATTT GATGTTAACC 480 
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GGCGAGGTTA AAATGTCGTG TTCTTTTGTC ATTGAGCCGG CTGATAAGTT GTCGTTGTAT 540 

GGGTACAATG CCAACACTGT CGTTAATCTG TTTAAAGTGA GTTTTGGAAA TTTTTATGTA 600 

TCTCAACGTA TTGGAAAGAT TCTGCAGAAT GAGGGATTCG TAGTTTATGA AATCGACGTA 660 

GATGTTTTGA CTCGTTTCTT CGTCGATAAT GGI 1 1 1 1 IGA GTTTCGGATG GTATAATGTA 720 

5 AAAAAATATA TTCCTCAAGA TATGGGAAAA GGGAGTAATC TTGAGGTGGA AATTAATTGT 780 

CATGTCTCTG ATTTAGTTTC TCTGGAAGAC GTTMTTGGC CTTTATATGG ATGCTGGTCT 840 

TTCGACATAG AGTGTTTGGG TCAAAATGGG AATTTCCCGG ATGCCGAAAA TTTAGGTGAT 900 

ATAGTTATTC AGATTTCTGT AATTAGTTTC GATACGGAAG GTGACCGTGA TGAGCGACAT 960 

CTGTTTACTC TGGGAACATG TGAAAAAATT GACGGCGTGC ATATATATGA ATTTGCGTCA 1020 

10 GAGTTTGAAT TACTTTTGGG J 1 1 1 1 1 C ATA TTTTTAAGGA TTGAGTCTCC GGAGTTTATT 1080 

ACCGGTTATA ATATTAATAA TTTTGATTTA AAATATTTGT GTATAAGGAT GGATAAGATT 1140 

TACCATTATG ATATTGGTTG TTTTTCGAAA CTGAAGAATG GAAAGATTGG AATCTCTGTC 1200 

CCTCACGAAC AGTACAGGAA GGGGTTCCTT CAGGCGCAAA CCAAGGTGTT TACTTCCGGA 1260 

GTGTTGTATC TGGATATGTA TCCCGTCTAT TCTAGTAAGA TAACGGCGCA GAATTACAAA 1320 

15 CTGGATACTA TTGCTAAGAT CTGTCTCCAG CAAGAAAAGG AGCAGTTATC GTACAAGGAA 1380 

ATACCAAAGA AATTTATTAG TGGACCCAGT GGCAGGGCTG TTGTCGGTAA GTATTGTCTA 1440 

CAGGACTCTG TCTTAGTTGT GCGTCTCTTT AAACAGATTA ATTATCATTT TGAGGTTGCC 1500 

GAGGTCGCCA GATTGGCACA CGTCACGGCT AGATGTGTGG TGTTCGAGGG TCAGCAGAAG 1560 

AAGATATTTC CCTGCATTCT TACGGAAGCA AAACGCCGTA ATATGATTCT TCCGAGTATG 1620 

20 GTGTCTTCGC ACAATAGACA AGGGATAGGT TACAAAGGGG CTACCGTTTT GGAGCCTAAG 1680 

ACGGGTTATT ATGCTGTGCC TACCGTGGTG TTTGATTTTC AAAGTTTGTA TCCGAGCATT 1740 

ATGATGGCGC ATAATCTGTG TTATAGTACT TTAGTTTTGG ATGAACGACA GATAGCTGGA 1800 

TTGTCAGAGA GTGACATCTT AACCGTGAAG TTGGGGGATG AGACTCATCG GTTTGTGAAG 1860 

CCTTGTATCC GTGAGTCTGT GCTTGGGAGT CTACTAAAGG ACTGGCTGGC CAAGAGACGA 1920 

25 GAAGTGAAGG CGGAGATGCA GAACTGTTCG GATCCGATGA TGAAACTTCT TCTGGATAAA 1980 

AAGCAGCTCG CTCTGAAAAC AACATGTAAC TCGGTGTACG GTGTCACGGG AGCGGCGCAC 2040 

GGGTTATTGC CGTGTG7TGC GATTGCTGCT TCTGTAACGT GTCTTGGAAG AGAGATGCTT 2100 

TGTTCCACGG TGGATTATGT TAATTCCAAG ATGCAGTCCG AGCAATTTTT TTGCGAGGAA 2160 

TTTGGTTTAA CGTCATCAGA TTTTACTGGT GATTTGGAAG TGGAGGTAAT TTATGGTGAT 2220 

30 ACGGATAGCA TCTTTATGTC TGTCAGAAAT ATGGTTAATC AGTCTCTGCG AAGGATTGCG 2280 

CCGATGATCG CCAAACATAT CACAGATCGT CTGTTCAAGT CGCCTATCAA GCTCGAGTTT 2340 

GAAAAGATTT TA TGTCCG CT AATTTTGATT TGTAAAAAAA GATACATTGG TAGACAGGAT 2400 

GATTCGCTTT TAATTTTTAA GGGGGTAGAT CTGGTGAGAA AGACTTCTTG CGATTTTGTG 2460 

AAGGGTGTGG TGAAAGATAT CGTGGACTTG TTGTTCTTTG ATGAAGAGGT TCAGACTGCT 2520 

35 GCTGTGGAGT TTTCTCACAT GACACAGACA CAGTTGCGTG AACAAGGAGT GCCTGTGGGT 2580 

ATTCATAAAA TTTTGCGTCG TCTGTGCGAA GCGCGGGAGG AGCTTTTTCA GAATCGGGCA 2640 

GACGTGAGAC ATTTAATGTT GTCCTCTGTG CTTTCCAAAG AAATGGCTGC ATATAAGCAA 2700 

CCGAATCTGG CTCACCTTAG CGTCATTAGA AGGTTGGCGC AGAGAAAGGA AGAAATTCCG 2760 

AATGTAGGTG ACCGAATTAT GTACGTGTTA ATAGCACCAT CTATTGGCAA TAAACAGACG 2820 

40 CATAACTATG AATTAGCAGA AGATCCCAAC TATGTGATAG AACACAAGAT TCCTATACAT 2880 

GCGGAGAAGT ATTTCGATCA GATTATCAAG GCTGTGACTA ATGCGATCTC ACCCATTTTT 2940 

CCGAAAACCG ATATAAAAAA AGAGAAGTTA CTATTGTATT TACTTCCTAT GAAAGTGTAT 3000 

TTGGATGAAA CATTTTCTGC TATTGCAGAG GTAATGTGA 3039 

45 SEQUENCE DESCRIPTION: SEQ ID N0:28: 

ATGGCGATCA GAACGGGGTT TTGTAATCCC TTTTTAACCC AAGCATCAGG GATTAAATAT 60 

AACCCAAGAA CC GGGCG CGG TAGTAACAGA GAATTTCTTC ATAGTTACAA AACTACCATG 120 

TCATCGTTTC AATTTTTGGC CCCTAAATGT TTAGATGAAG ATGTGCCCAT GGAAGAACGA 180 

AAGGGGGTTC ACGTCGGTAC ACTTAGTCGA CCGCCTAAAG TTTACTGTAA TGGAAAAGAA 240 

50 GTTCCGATTC TGGATTTTCG TTGTTCCAGC CCCTGGCCTA GACGCGTGAA TATTTGGGGG 300 

GAAATCGACT TTCGTGGGGA TAAGTTTGAC CCCCGCTTTA ACACATTCCA TGTATATGAT 360 

ATTGTCGAAA CAACAGAAGC CGCGTCTAAT GGAGATGTAT CCCGGTTTGC AACTGCAACA 420 

CGACCGCTTG GTACCGTTAT TACTTTACTT GGCATGTCCC GATGTGGAAA AAGGGTGGCA 480 

GTTCATGTAT ACGGCATCTG TCAATATTTT TATATAAACA AAGCCGAGGT GGATACCGCT 540 

55 TGTGGCATAC GTTCCGGTAG CGAGTTATCT GTATTACTTG CCGAGTGTTT ACGCAGTTCT 600 

ATGATAACAC AAAATGATGC AACGTTAAAT GGAGACAAGA ACGCTTTTCA TGGTACCTCG 660 

TTTAAAAGCG CATCTCCAGA AAGCTTTCGC GTTGAGGTTA TTGAGCGCAC AGATGTTTAT 720 

TACTACGATA CACAGCCATG TGCGTTTTAC AGGGTGTATT CTCCCTCATC TAAATTTACA 780 

AATTATCTTT GTGATAACTT TCACCCGGAG TTGAAAAAGT ATGAAGGTCG GGTAGACGCT 840 

60 ACCACTCGTT TTCTAATGGA TAATCCCGGC TTTGTTAGTT TTGGTTGGTA TCAACTAAAA 900 

CCTGGAGTTG ATGGGGAACG TGTTCGAGTT CGACCGGCAA GTCGCCMTT AACGTTAAGC 960 

GACGTTGAAA TTGACTGCAT GTCGGATAAT CTGCAGGCTA TACCAAACGA TGACTCATGG 1020 

CCTGACTACA AGTTGTTATG TTTCGATATT GAATGTAAAT CAGGAGGATC TAATGAGCTG 1080 

GCGTTTCCCG ATGCAACACA TCTGGAGGAT CTTGTAATCC AAATTTCTTG TCTATTATAT 1140 
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TCAATCCCTC GACAGTCTTT AGAACACATT TTACTGTTTT CCCTTGGCTC TTGTGACTTA 1200 

CCACAAAGGT ATGTACAAGA AATGAAGGAC GCGGGGTTAC CGGAGCCGAC TGTGCTGGAG 1260 

TTTGATAGTG AATTCGAGCT ATTAATTGCA TTTATGACCC TCGTAAAACA GTACGCTCCC 1320 

GAGTTTGCCA CAGGTTATAA CATTGTTAAT TTTGATTGGG CGTTTATTAT GGAGAAACTT 1380 

5 AATTCTATAT ACAGTCTCAA GCTTGATGGT TATGGCAGTA TAAACCGTGG GGGTCTGTTT 1440 

AAGATATGGG ATGTTGGCAA ATCCGGATTT CAGCGACGAA GCAAGGTAAA GATCAACGGT 1500 

CTCATATCTC TGGATATGTA TGCAATTGCA ACTGAAAAAT TAAAACTCTC GAGTTATAAA 1560 

TTAGATTCGG TTGCACGTGA AGCTCTAAAT GAGTCCAAGA GAGATTTGCC CTACAAAGAC 1620 

ATTCCGGGAT ATTACGCTAG TGGACCGAAT ACACGAGGAA TTATTGGTGA ATATTGTATA 1680 

10 CAAGACTCGG CTCTTGTGGG GAAACTGTTT TTTAAATATT TACCACACCT TGAGTTATCC 1740 

GCGGTTGCAA GGCTAGCTAG AATTACTTTA ACCAAGGCTA TTTACGACGG ACAGCAGGTT 1800 

AGGATTTACA CCTGTTTATT AGGACTGGCT TCGTCTCGAG GATTTATTTT ACCCGATGGG 1860 

GGATACCCAG CTACTTTTGA ATATAAGGAT GTTATTCCCG ATGTCGGGGA TGTTGAGGAA 1920 

GAGATGGATG AAGACGAGAG CGTTTCTCCC ACTGGTACGT CAAGTGGGCG AAATGTAGGA 1980 

15 TATAAAGGAG CCAGGGTTTT TGACCCTGAT ACGGGATTTT ATATCGATCC GGTGGTCGTA 2040 

TTGGATTTTG CAAGTTTATA TCCAAGTATA ATTCAGGCCC ATAACTTATG TTTTACCACG 2100 

CTAACGTTAA ATTTTGAGAC GGTTAAACGT TTGAATCCAT CCGATTATGC CACCTTTACA 2160 

GTTGGAGGAA AACGTCTTTT TTTTGTGCGC TCTAACGTTC GAGAAAGTCT GCTGGGTGTT 2220 

CTTTTAAAAG ACTGGTTGGC TATGCGCAAG GCTATTAGAG CGCGCATACC CGGAAGTTCT 2280 

20 TCAGATGAAG CAGTGTTATT AGACAAACAA CAAGCCGCGA TAAAAGTAGT TTGTAATTCC 2340 

GTGTACGGTT TTACTGGAGT TGCGCAGGGA TTTCTGCCAT GTTTATACGT AGCGGCCACT 2400 

GTCACTACAA TTGGCCGTCA AATGTTATTA AGTACCAGAG ATTATATTCA TAATAACTGG 2460 

GCCGCATTTG AACGTTTTAT TACAGCGTTT CCAGACATTG AAAGTAGCGT TCTCTCCCAA 2520 

AAAGCGTACG AGGTAAAGGT TATATATGGA GATACGGATT CTGTGTTTAT CCGATTCAAG 2580 

25 GGTGTTAGTG TTGAGGGGAT AGCTAAAATC GGCGAGAAAA TGGCACATAT AATTTCAACG 2640 

GCTCTGTTTT GTCCTCCTAT AAAGTTGGAG TGTGAAAAAA CTTTTATAAA ACTTTTGCTT 2700 

ATAACAAAGA AAAAGTACAT TGGGGTAATT TACGGCGGAA AGGTTTTAAT GAAGGGAGTC 2760 

GACTTGGTTA GAAAAAACAA CTGTCAATTT ATTAACGATT ATGCCCGCAA ACTTGTAGAA 2820 

CTGTTGTTAT ATGACGACAC CGTCTCGCGT GCTGCGGCGG AGGCGTCGTG TGTTTCCATT 2880 

30 GCTGAATGGA ATAGACGGGC CATGCCGTCT GGGATGGCCG GGTTTGGACG CATAATTGCA 2940 

GATGCACATC GCCAGATTAC ATCACCCAAA TTGGATATTA ATAAGTTTGT TATGACGGCC 3000 

GAGCTTAGTC GTCCACCATC CGCCTACATA AACCGTCGCT TGGCTCACTT AACAGTATAT 3060 

TATAAATTAG TAATGAGACA GGGTCAAATC CCAAACGTTC GAGAACGCAT CCCTTATGTT 3120 

ATTGTGGCCC CCACAGACGA AGTGGAGGCT GATGCAAAAA GTGTAGCTTT GCTACGTGGA 3180 

35 GATCCTTTAC AGAATACCGC AGGTAAACGG TGTGGGGAAG CAAAGCGTAA GTTAATAATA 3240 

TCTG ACTTAG CGGAAGATCC CATTCACGTA ACATCACACG GGCTGTCTTT AAACATTGAC 3300 

TATTATTTTT CTCATCTCAT TGGGACGGCG AGTGTAACTT TTAAGGCGTT ATTTGGAAAC 3360 

GACACTAAAC TCACAGAACG GCTTTTAAAA CGTTTTATTC CAGAGACACG AGTTGTTAAC 3420 

G7TAAAATGC TAAACCGCTT GCAGGCGGCA GGCTTTGTTT GTATACACGC CCCGTGCTGG 3480 

40 GATAATAAAA TGAACACTGA AGCTGAAATC ACCGAGGAGG AACAAAGTCA TCAAATAATG 3540 

CGTAGAGTCT TTTGTATTCC AAAAGCAATT CTCCATCAAA GTTAA 3585 

SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATGTTTTCCG GTGGCGGCGG CCCGCTGTCC CCCGGAGGAA AGTCGGCGGC CAGGGCGGCG 60 

45 TCCGGGTTTT TTGCGCCCGC CGGCCCTCGC GGAGCCGGCC GGGGACCCCC GCCTTGCTTG 120 

AGGCAAAACT TTTACAACCC CTACCTCGCC CCAGTCGGGA CGCAACAGAA GCCGACCGGG 180 

CCAACCCAGC GCCATACGTA CTATAGCGAA TGCGATGAAT TTCGATTCAT CGCCCCGCGG 240 

GTGCTGGACG AGGATGCCCC CCCGGAGAAG CGCGCCGGGG TGCACGACGG TCACCTCAAG 300 

CGCGCCCCCA AGGTGTACTG CGGGGGGGAC GAGCGCGACG TCCTCCGCGT CGGGTCGGGC 360 

50 GGCTTCTGGC CGCGGCGCTC GCGCCTGTGG GGCGGCGTGG ACCACGCCCC GGCGGGGTTC 420 

AACCCCACCG TCACCGTCTT TCACGTGTAC GACATCCTGG AGAACGTGGA GCACGCGTAC 480 

GGCATGCGCG CGGCCCAGTT CCACGCGCGG TTTATGGACG CCATCACACC GACGGGGACC 540 

GTCATCACGC TCCTGGGCCT GACTCCGGAA GGCCACCGGG TGGCCGTTCA CGTTTACGGC 600 

ACGCGGCAGT ACTTTTACAT GAACAAGGAG GAGGTCGACA GGCACCTACA ATGCCGCGCC 660 

55 CCACGAGATC TCTGCGAGCG CATGGCCGCG GCCCTGCGCG AGTCCCCGGG CGCGTCGTTC 720 

CGCGGCATTT CCGCGGACCA CTTCGAGGCG GAGGTGGTGG AGCGCACCGA CGTGTACTAC 780 

TACGAGACGC GCCCCGCTCT GTTTTACCGC GTCTACGTCC GAAGCGGGCG CGTGCTGTCG 840 

TACCTGTGCG ACAACTTCTG CCCGGCCATC AAGAAGTACG AGGGTGGGGT CGACGCCACC 900 

ACCCGGTTCA TCCTGGACAA CCCCGGGTTC GTCACCTTCG GCTGGTACCG TCTCAAACCG 960 

60 GGCCGGAACA ACACGCTAGC CCAGCCGCGG GCCCCGATGG CCTTCGGGAC ATCCAGCGAC 1020 

GTCGAGTTTA ACTGTACGGC GGACAACCTG GCCATCGAGG GGGGCATGAG CGACCTACCG 1080 

GCATACAAGC TCATGTGCTT CGATATCGAA TGCAAGGCGG GGGGGGAGGA CGAGCTGGCC 1140 

TTTCCGGTGG CCGGGCACCC GGAGGACCTG GTCATCCAGA TATCCTGTCT GCTCTACGAC 1200 

CTGTCCACCA CCGCCCTGGA GCACGTCCTC CTGTTTTCGC TCGGTTCCTG CGACCTCCCC 1260 
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GAATCCCACC T6AACGAGCT GGCGGCCAGG 
GACAGCGAAT TCGAGATGCT GTTGGCCTTC 
TTCGTGACCG GGTACAACAT CATCAACTTC 
GACATTTACA AGGTCCCCCT GGACGGGTAC 
5 GTGTGGGACA TAGGCCAGAG CCACTTCCAG 
GTGAGCATCG ACATGTACGG GATTATAACC 
AACGCCGTGG CCGAAGCCGT CCTGAAGGAC 
CCCGCCTACT ACGCCGCCGG GCCCGCGCAA 
GATTCCCTGC TGGTGGGCCA GCTG UH1 I 

1 0 GTCGCGCGCT TGGCGGGTAT TAACATCACC 
GTCTTTACGT GCCTGCTGCG CCTGGCCGAC 
GGGCGATTTA GGGGCGCCGG GGGGGAGGCG 
GAGGAGCGGC CAGAGGAGGA GGGGGAGGAC 
CGGGAGCCGG ACGGCGCGCG GGAGACCGCC 

15 GTCCTTGACC CCACTTCCGG GTTTCACGTG 
CTGTACCCCA GCATCATCCA GGCCCACAAC 
GACGCAGTGG CGCACCTGGA GGCGGGCAAG 
CGGCTGTTCT TCGTCAAGGC TCACGTGCGA 
TGGCTCGCCA TGCGAAAGCA GATCCGCTCG 

20 GTGCTCCTGG ACAAGCAGCA GGCCGCCATC 
ACGGGAGTGC AGCACGGACT CCTGCCGTGC 
GGCCGCGAGA TGCTGCTCGC GACCCGCGAG 
CAGCTCCTGG CCGATTTCCC GGAGGCGGCC 
CGCATCATCT ACGGGGACAC GGACTCCATA 

25 GGGCTGACGG CCATGGGCGA CAAGATGGCG 
CCCATCAAAC TCGAGTGCGA AAAGACGTTC 
TACATCGGCG TCATCTACGG GGGTAAGATG 
AACAACTGCG CGTTTATCAA CCGCACCTCC 
GATACCGTCT CCGGAGCGGC CGCCGCGTTA 

30 CGACCCCTGC CCGAGGGACT GCAGGCGTTC 
ATCACCGACC CGGAGAGGGA CATCCAGGAC 
CCGCGCGCGT ACACCAACAA GCGCCTGGCC 
CGCCGCGCGC AGGTCCCGTC CATCAAGGAC 
CGCGAGGTAG AGGAGACGGT CGCGCGGCTG 

35 CCAGGGGACG AGCCCGCCCC CCCCGCGGCC 
ACGCCGTCGC CTGCCGACCC CCCGGGAGGC 
GAGCTGGCCG AGGATCCCGC ATACGCCATT 
TACTTCTCCC ACCTGTTGGG GGCGGCGTGC 
GCCAAGATCA CCGAGAGTCT GTTAAAAAGG 

40 GACGTGACCG CGCGGCTCCG GGCCGCAGGG 
GAGGAAACTC GTCGAATGTT GCATAGAGCC 



GGCCTGCCCA CGCCCGTGGT TCTGGAATTC 1320 

ATGACCCTTG TGAAACAGTA CGGCCCCGAG 1380 

GACTGGCCCT TCTTGCTGGC CAAGCTGACG 1440 

GGCCGCATGA ACGGCCGGGG CGTGTTTCGC 1500 

AAGCGCAGCA AGATAAAGGT GAACGGCATG 1560 

GACAAGATCA AGCTCTCGAG CTACAAGCTC 1620 

AAGAAGAAGG ACCTGAGCTA TCGCGACATC 1680 

CGCGGGGTGA TCGGCGAGTA CTGCATACAG 1740 

AAGTTTTTGC CCCATCTGGA GCTCTCGGCC 1800 

CGCACCATCT ACGACGGCCA GCAGATCCGC 1860 

CAGAAGGGCT TTATTCTGCC GGACACCCAG 1920 

CCCAAGCGTC CGGCCGCAGC CCGGGAGGAC 1980 

GAGGACGAAC GCGAGGAGGG CGGGGGCGAG 2040 

GGCCGGCACG TGGGGTACCA GGGGGCCAGG 2100 

AACCCCGTGG TGGTGTTCGA CTTTGCCAGC 2160 

CTGTGCnCA GCACGCTCTC CCTGAGGGCC 2220 

GACTACCTGG AGATCGAGGT GGGGGGGCGA 2280 

GAGAGCCTCC TCAGCATCCT CCTGCGGGAC 2340 

CGGATTCCCC AGAGCAGCCC CGAGGAGGCC 2400 

AAGGTCGTGT GTAACTCGGT GTACGGGTTC 2460 

CTGCACGTTG CCGCGACGGT GACGACCATC 2520 

TACGTCCACG CGCGCTGGGC GGCCTTCGAA 2580 

GACATGCGCG CCCCCGGGCC CTATTCCATG 2640 

TTTGTGCTGT GCCGCGGCCT CACGGCCGCC 2700 

AGCCACATCT CGCGCGCGCT GTTTCTGCCC 2760 

ACCAAGCTGC TGCTGATCGC CAAGAAAAAG 2820 

CTCATCAAGG GCGTGGATCT GGTGCGCAAA 2880 

AGGGCCCTGG TCGACCTGCT GTTTTACGAC 2940 

GCCGAGCGCC CCGCAGAGGA GTGGCTGGCG 3000 

GGGGCCGTCC TCGTAGACGC CCATCGGCGC 3060 

TTTGTCCTCA CCGCCGAACT GAGCAGACAC 3120 

CACCTGACGG TGTATTACAA GCTCATGGCC 3180 

CGGATCCCGT ACGTGATCGT GGCCCAGACC 3240 

GCCGCCCTCC GCGAGCTAGA CGCCGCCGCC 3300 

CTGCCCTCCC CGGCCAAGCG CCCCCGGGAG 3360 

GCGTCCAAGC CCCGCAAGCT GCTGGTGTCC 3420 

GCCCACGGCG TCGCCCTGAA CACGGACTAT 3480 

GTGACATTCA AGGCCCTGTT TGGGAATAAC 3540 

TTTATTCCCG AAGTGTGGCA CCCCCCGGAC 3600 

TTCGGGGCGG TGGGTGCCGG CGCTACGGCG 3660 

TTTGATACTC TAGCATGA 3708 



45 



SEQUENCE DESCRIPTION: SEQ ID NO: 73 
He Ala Glu Thr Val Thr Leu 

SEQUENCE DESCRIPTION: SEQ ID NO: 74 
Val Ala Ser Gly He Leu Pro 



SEQUENCE DESCRIPTION: SEQ ID NO: 75 
50 Gly He Leu Pro Cys Leu Asn 



SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
Cys Leu Asn He Ala Glu Thr 



55 SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
Gl n Gly Arg Lys Met Leu Gl u 



60 



SEQUENCE DESCRIPTION: SEQ ID NO: 78 
Ser Gin Ala Phe Val Glu 

SEQUENCE DESCRIPTION: SEQ ID NO: 79; 
Ala Arg Phe Lys Val lie 

SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
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Leu Glu Thr Ser Gin Ala Phe 

SEQUENCE DESCRIPTION: SEQ ID N0:81: 
Leu Glu Arg Ser Gin Ala Phe 

5 

SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
Glu Gly He Ser Pro Thr Ala 

SEQUENCE DESCRIPTION: SEQ ID N0:83: 
10 Glu Ala He Ser Pro Glu Arg 

SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
Ala Asp Leu Leu Gin Arg Pro 

15 SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
Ala Gly Leu Leu Arg Arg Pro 



20 



35 



50 



SEQUENCE DESCRIPTION: SEQ ID N0:86; 
Gin Arg Pro He Glu Ala Ser 

SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
Arg Arg Pro He Asp Val Ser 



SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
25 He Glu Ala Ser Pro Glu Ala 

SEQUENCE DESCRIPTION: SEQ ID N0:89: 
He Asp Val Ser Pro Asp Ala 

30 SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
Pro Asp Asp Tyr Glu Thr Phe 



SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
Lys Arg Lys Glu He Arg Lys 

SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
Leu Ala Lys Arg Lys Glu He 



SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
40 Leu Ala Ser Cys Thr Asp Pro 

SEQUENCE DESCRIPTION: SEQ ID N0:94: 
Thr Gly Ser Ala Leu His Gly 

45 SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
Pro Gly Asp Ser Leu His Leu 



SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
Ser Ala Leu His Gly His Pro 

SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
Asp Ser Leu His Leu His Pro 



SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

55 Gly His Pro Glu Leu Thr Pro 

SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Leu His Pro His Leu Gly Pro 

60 SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

His Leu Ser Gly Gly Thr Val 

SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

Val Leu Ser Gly Gly Leu Val 
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10 



25 



35 



55 



60 



SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
Thr Asp Pro Thr Met Arg Thr 

SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
Thr Asp Pro A1 a Leu Lys Thr 

SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

CAGTATCATC CAAGCGCACA A 21 
SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CCAAGTATCA THCARGCNCA YAA 23 



SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
15 GGAGTAGCAC AARTTRTGNG CYTG 24 

SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

TACGAAACCT TTGACCTNAG YGGNGG 26 

20 SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CGCAAGAACC TGGCCTCNTG YACNGAYCC 29 



SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

TCTCAGGCGT TCGTAGARGG NATHTCNCC 29 
SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

CAGCTGGCCA TCAAGGTCAC 20 



SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
30 AACGCGGTGT ACGGGTTCAC 20 

SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Arg Thr He Leu Asp Lys Gin Gin Leu Ala He Lys Val Thr Cys Asn 
Ala Val Tyr Gly Phe Thr Gly Val Ala Ser Gly lie Leu Pro Cys Leu 



SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Ser He He Gin Ala His Asn Leu Cys Tyr Ser Thr Leu He Pro 



SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

40 CGTTGCCTCT GGCATACTGC CTTGCCTAAA CATAGCGGAG ACCGTGACAC TACAAGGGCG 60 

AAAGATGCTG GAGAGATCTC AGGCCTTTGT AGAGGCCATC TCGCCGGAAC GCCTAGCGGG 120 

TCTCCTGCGG AGGCCAATAG ACGTCTCACC CGACGCCCGA TTCAAGGTCA TA 172 

SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
45 Val Ala Ser Gly He Leu Pro Cys Leu Asn He Ala Glu Thr Val Thr 

Leu Gin Gly Arg Lys Met Leu Glu Arg Ser Gin Ala Phe Val Glu Ala 
He Ser Pro Glu Arg Leu Ala Gly Leu Leu Arg Arg Pro He Asp Val 
Ser Pro Asp Ala Arg Phe Lys Val He 

50 



SEQUENCE DESCRIPTION: SEQ ID NO 
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GAC 


GAC 


CGC 


AGC 


GTG 


TGC 


GTG 


AAY 


GTN 


TTY 


GGN 


CAG 


CGC 


TGC 


TAC 


TTC 


48 


Asp 


Asp 


Arg 


Ser 


Val 


Cys 


Val 


Asn 


Val 


Phe 


Gly 


Gin 


Arg 


Cys 


Tyr 


Phe 




TAC 


ACA 


CTA 


GCA 


CCC 


CAG 


GGG 


GTA 


AAC 


CTG 


ACC 


CAC 


GTC 


CTC 


CAG 


CAG 


96 


Tyr 


Thr 


Leu 


Ala 


Pro 


Gin 


Gly 


Val 


Asn 


Leu 


Thr 


His 


Val 


Leu 


Gin 


Gin 




GCC 


CTC 


CAG 


GCT 


GGC 


TTC 


GGT 


CGC 


GCA 


TCC 


TGC 


GGC 


TTC 


TCC 


ACC 


GAG 


144 


Ala 


Leu 


Gin 


Ala 


Gly 


Phe 


Gly 


Arg 


Ala 


Ser 


Cys 


Gly 


Phe 


Ser 


Thr 


Glu 




CCG 


GTC 


AGA 


AAA 


AAA 


ATC 


TTG 


CGC 


GCG 


TAC 


GAC 


ACA 


CAA 


CAA 


TAT 


GCT 


192 


Pro 


Val 


Arg 


Lys 


Lys 


He 


Leu 


Arg 


Ala 


Tyr 


Asp 


Thr 


Gin 


Gin 


Tyr 


Ala 




GTG 


CAA 


AAA 


ATA 


ACC 


CTG 


TCA 


TCC 


AGT 


CCG 


ATG 


ATG 


CGA 


ACG 


CTT 


AGC 


240 


Val 


Gin 


Lys 


He 


Thr 


Leu 


Ser 


Ser 


Ser 


Pro 


Met 


Met 


Arg 


Thr 


Leu 


Ser 




GAC 


CGC 


CTA 


ACA 


ACC 


TGT 


GGG 


TGC 


GAG 


GTG 


TTT 


GAG 


TCC 


AAT 


GTG 


GAC 


288 


Asp 


Arg 


Leu 


Thr 


Thr 


Cys 


Gly 


Cys 


Glu 


Val 


Phe 


Glu 


Ser 


Asn 


Val 


Asp 




GCC 


ATT 


AGG 


CGC 


TTC 


GTG 


CTG 


GAC 


CAC 


GGG 


TTC 


TCG 


ACA 


TTC 


GGG 


TGG 


336 


Ala 


He 


Arg 


Arg 


Phe 


Val 


Leu 


Asp 


His 


Gly 


Phe 


Ser 


Thr 


Phe 


Gly 


Trp 
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TAC GAG TGC AGC AAC CCG GCC CCC CGC ACC CAG GCC AGA GAC TCT TGG 384 
Tyr Glu Cys Ser Asn Pro Ala Pro Arg Thr Gin Ala Arg Asp Ser Trp 
ACG GAA CTG GAG TTT GAC TGC AGC TGG GAG GAC CTA AAG TTT ATC CCG 432 
Thr Glu Leu Glu Phe Asp Cys Ser Trp Glu Asp Leu Lys Phe He Pro 
5 GAG AGG ACG GAG TGG CCC CCA TAC ACA ATC CTA TCC TTT GAT ATA GAA 480 
Glu Arg Thr Glu Trp Pro Pro Tyr Thr He Leu Ser Phe Asp He Glu 
TGT ATG GGC GAG AAG GGT TTT CCC AAC GCG ACT CAA GAC GAG GAC ATG 528 
Cys Met Gly Glu Lys Gly Phe Pro Asn Ala Thr Gin Asp Glu Asp Met 
ATT ATA CAA ATC TCG TGT GTT TTA CAC ACA GTC GGC AAC GAT AAA CCG 576 

10 He He Gin lie Ser Cys Val Leu His Thr Val Gly Asn Asp Lys Pro 

TAC ACC CGC ATG CTA CTG GGC CTG GGG ACA TGC GAC CCC CTT CCT GGG 624 
Tyr Thr Arg Met Leu Leu Gly Leu Gly Thr Cys Asp Pro Leu Pro Gly 
GTG GAG GTC TTT GAG TTT CCT TCG GAG TAC GAC ATG CTG GCC GCC TTC 672 
Val Glu Val Phe Glu Phe Pro Ser Glu Tyr Asp Met Leu Ala Ala Phe 

15 CTC AGC ATG CTC CGC GAT TAC AAT GTG GAG TTT ATA ACG GGG TAC AAC 720 
Leu Ser Met Leu Arg Asp Tyr Asn Val Glu Phe He Thr Gly Tyr Asn 
ATA GCA AAC TTT GAC CTT CCA TAC ATC ATA GCC CGG GCA ACT CAG GTG 768 
He Ala Asn Phe Asp Leu Pro Tyr He He Ala Arg Ala Thr Gin Val 
TAC GAC TTC AAG CTG CAG GAC TTC ACC AAA ATA AAA ACT GGG TCC GTG 816 

20 Tyr Asp Phe Lys Leu Gin Asp Phe Thr Lys He Lys Thr Gly Ser Val 

TTT GAG GTC CAC CAA CCC AGA GGC GGT TCC GAT GGG GGC AAC TTC ATG 864 
Phe Glu Val His Gin Pro Arg Gly Gly Ser Asp Gly Gly Asn Phe Met 
AGG TCC CAG TCA AAG GTC AAA ATA TCG GGG ATC GTC CCC ATA GAC ATG 912 
Arg Ser Gin Ser Lys Val Lys He Ser Gly He Val Pro He Asp Met 

25 TAC CAG GTT TGC AGG GAA AAG CTG AGT CTG TCA GAC TAC AAG CTG GAC 960 
Tyr Gin Val Cys Arg Glu Lys Leu Ser Leu Ser Asp Tyr Lys Leu Asp 
ACA GTG GCT AAG CAA TGC CTC GGT CGA CAA AAA GAT GAC ATC TCA TAC 1008 
Thr Val Ala Lys Gin Cys Leu Gly Arg Gin Lys Asp Asp He Ser Tyr 
AAG GAC ATA CCC CCG CTT TTT AAA TCT GGG CCT GAT GGT CGC GCA AAG 1056 

30 Lys Asp He Pro Pro Leu Phe Lys Ser Gly Pro Asp Gly Arg Ala Lys 

GTG GGA AAC TAC TGT GTT ATT GAC TCG GTC CTG GTT ATG GAT CTT CTG 1104 
Val Gly Asn Tyr Cys Val He Asp Ser Val Leu Val Met Asp Leu Leu 
CTA CGG TTT CAG ACC CAT GTT GAG ATC TCG GAA ATA GCC AAG CTG GCC 1152 
Leu Arg Phe Gin Thr His Val Glu He Ser Glu He Ala Lys Leu Ala 

35 AAG ATC CCC ACC CGT AGG GTA CTG ACG GAC GGC CAA CAG ATC AGG GTA 1200 
Lys He Pro Thr Arg Arg Val Leu Thr Asp Gly Gin Gin He Arg Val 
TTT TCC TGC CTC TTG GAG GCT GCT GCC ACG GAA GGT TAC ATT CTC CCC 1248 
Phe Ser Cys Leu Leu Glu Ala Ala Ala Thr Glu Gly Tyr He Leu Pro 
GTC CCA AAA GGA GAC GCG GTT AGC GGG TAT CAG GGG GCC ACT GTA ATA 1296 

40 Val Pro Lys Gly Asp Ala Val Ser Gly Tyr Gin Gly Ala Thr Val He 

AGC CCC TCT CCG GGA TTC TAT GAC GAC CCC GTA CTC GTG GTG GAT TTT 1344 
Ser Pro Ser Pro Gly Phe Tyr Asp Asp Pro Val Leu Val Val Asp Phe 
GCC AGC TTG TAC CCC AGT ATC ATC CAA GCG CAC AAC TTG TGC TAC TCC 1392 
Ala Ser Leu Tyr Pro Ser He He Gin Ala His Asn Leu Cys Tyr Ser 

45 ACA CTG ATA CCC GGC GAT TCG CTC CAC CTG CAC CCA CAC CTC TCC CCG 1440 
Thr Leu He Pro Gly Asp Ser Leu His Leu His Pro His Leu Ser Pro 
GAC GAC TAC GAA ACC TTT GTC CTC AGC GGA GGT CCG GTC CAC TTT GTA 1488 
Asp Asp Tyr Glu Thr Phe Val Leu Ser Gly Gly Pro Val His Phe Val 
AAA AAA CAC AAA AGG GAG TCC CTT CTT GCC AAG CTT CTG ACG GTA TGG 1536 

50 Lys Lys His Lys Arg Glu Ser Leu Leu Ala Lys Leu Leu Thr Val Trp 

CTC GCG AAG AGA AAA GAA ATA AGA AAG ACC CTG GCA TCA TGC ACG GAC 1584 
Leu Ala Lys Arg Lys Glu He Arg Lys Thr Leu Ala Ser Cys Thr Asp 
CCC GCA CTG AAA ACT ATT CTA GAC AAA CAA CAA CTG GCC ATC AAG GTT 1632 
Pro Ala Leu Lys Thr He Leu Asp Lys Gin Gin Leu Ala He Lys Val 

55 ACC TGC AAC GCC GTT TAC GGC TTC ACG GGC GTT GCC TCT GGC ATA CTG 1680 
Thr Cys Asn Ala Val Tyr Gly Phe Thr Gly Val Ala Ser Gly He Leu 
CCT TGC CTA AAC ATA GCG GAG ACC GTG ACA CTA CAA GGG CGA AAG ATG 1728 
Pro Cys Leu Asn lie Ala Glu Thr Val Thr Leu Gin Gly Arg Lys Met 
CTG GAG AGA TCT CAG GCC TTT GTA GAG GCC ATC TCG CCG GAA CGC CTA 1776 

60 Leu Glu Arg Ser Gin Ala Phe Val Glu Ala He Ser Pro Glu Arg Leu 

GCG GGT CTC CTG CGG AGG CCA GTA GAC GTC TCA CCC GAC GCC CGA TTC 1824 
Ala Gly Leu Leu Arg Arg Pro Val Asp Val Ser Pro Asp Ala Arg Phe 
AAG GTC ATA TAC GGC GAC ACT GAC TCT CTT TTC ATA TGC TGC ATG GGT 1872 
Lys Val He Tyr Gly Asp Thr Asp Ser Leu Phe He Cys Cys Met Gly 
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TTC AAC ATG GAC AGC GTG TCA GAC TTC GCG GAG GAG CTA GCG TCA ATC 1920 
Phe Asn Met Asp Ser Val Ser Asp Phe Ala Glu Glu Leu Ala Ser lie 
ACC ACC AAC ACG CTG TTT CGT AGC CCC ATC AAG CTG GAG GCT GAA AAG 1968 
Thr Thr Asn Thr Leu Phe Arg Ser Pro He Lys Leu Glu Ala Glu Lys 
5 ATC TTC AAG TGC CTT CTG CTC CTG ACT AAA AAG AGA TAC GTG GGG GTA 2016 
lie Phe Lys Cys Leu Leu Leu Leu Thr Lys Lys Arg Tyr Val Gly Val 
CTC AGT GAC GAC AAG GTT CTG ATG AAG GGC GTA GAC CTC ATT AGG AAA 2064 
Leu Ser Asp Asp Lys Val Leu Met Lys Gly Val Asp Leu He Arg Lys 
ACA GCC TGT CGT TTT GTC CAG GAA AAG AGC AGT CAG GTC CTG GAC CTC 2112 

10 Thr Ala Cys Arg Phe Val Gin Glu Lys Ser Ser Gin Val Leu Asp Leu 

ATA CTG CGG GAG CCG AGC GTC AAG GCC GCG GCC AAG CTT ATT TCG GGG 2160 
lie Leu Arg Glu Pro Ser Val Lys Ala Ala Ala Lys Leu He Ser Gly 
CAG GCG ACA GAC TGG GTG TAC AGG GAA GGG CTC CCA GAG GGG TTC GTC 2208 
Gin Ala Thr Asp Trp Val Tyr Arg Glu Gly Leu Pro Glu Gly Phe Val 

1 5 AAG ATA ATT CAA GTG CTC AAC GCG AGC CAC CGG GAA CTG TGC GAA CGC 2256 
Lys He He Gin Val Leu Asn Ala Ser His Arg Glu Leu Cys Glu Arg 
AGC GTA CCA GTA GAC AAA CTG ACG TTT ACC ACC GAG CTA AGC CGC CCG 2304 
Ser Val Pro Val Asp Lys Leu Thr Phe Thr Thr Glu Leu Ser Arg Pro 
CTG GCG GAC TAC AAG ACG CAA AAC CTC CCG CAC CTG ACC GTG TAC CAA 2352 

20 Leu Ala Asp Tyr Lys Thr Gin Asn Leu Pro His Leu Thr Val Tyr Gin 

AAG CTA CAA GCT AGA CAG GAG GAG CTT CCA CAG ATA CAC GAC AGA ATC 2400 
Lys Leu Gin Ala Arg Gin Glu Glu Leu Pro Gin He His Asp Arg He 
CCC TAC GTG TTC GTC GAC GCC CCA GGT AGC CTG CGC TCC GAG CTG GCA 2448 
Pro Tyr Val Phe Val Asp Ala Pro Gly Ser Leu Arg Ser Glu Leu Ala 

25 GAG CAC CCC GAG TAC GTT AAG CAG CAC GGA CTG CGC GTG GCG GTG GAC 2496 
Glu His Pro Glu Tyr Val Lys Gin His Gly Leu Arg Val Ala Val Asp 
CTG TAT TTC GAC AAG 2511 
Leu Tyr Phe Asp Lys 

30 SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Asp Asp Arg Ser Val Cys Val Asn Val Phe Gly Gin Arg Cys Tyr Phe 
Tyr Thr Leu Ala Pro Gin Gly Val Asn Leu Thr His Val Leu Gin Gin 
Ala Leu Gin Ala Gly Phe Gly Arg Ala Ser Cys Gly Phe Ser Thr Glu 
Pro Val Arg Lys Lys He Leu Arg Ala Tyr Asp Thr Gin Gin Tyr Ala 

35 Val Gin Lys He Thr Leu Ser Ser Ser Pro Met Met Arg Thr Leu Ser 
Asp Arg Leu Thr Thr Cys Gly Cys Glu Val Phe Glu Ser Asn Val Asp 
Ala He Arg Arg Phe Val Leu Asp His Gly Phe Ser Thr Phe Gly Trp 
Tyr Glu Cys Ser Asn Pro Ala Pro Arg Thr Gin Ala Arg Asp Ser Trp 
Thr Glu Leu Glu Phe Asp Cys Ser Trp Glu Asp Leu Lys Phe He Pro 

40 Glu Arg Thr Glu Trp Pro Pro Tyr Thr He Leu Ser Phe Asp He Glu 
Cys Met Gly Glu Lys Gly Phe Pro Asn Ala Thr Gin Asp Glu Asp Met 
He He Gin He Ser Cys Val Leu His Thr Val Gly Asn Asp Lys Pro 
Tyr Thr Arg Met Leu Leu Gly Leu Gly Thr Cys Asp Pro Leu Pro Gly 
Val Glu Val Phe Glu Phe Pro Ser Glu Tyr Asp Met Leu Ala Ala Phe 

45 Leu Ser Met Leu Arg Asp Tyr Asn Val Glu Phe He Thr Gly Tyr Asn 
He Ala Asn Phe Asp Leu Pro Tyr He He Ala Arg Ala Thr Gin Val 
Tyr Asp Phe Lys Leu Gin Asp Phe Thr Lys He Lys Thr Gly Ser Val 
Phe Glu Val His Gin Pro Arg Gly Gly Ser Asp Gly Gly Asn Phe Met 
Arg Ser Gin Ser Lys Val Lys He Ser Gly He Val Pro He Asp Met 

50 Tyr Gin Val Cys Arg Glu Lys Leu Ser Leu Ser Asp Tyr Lys Leu Asp 
Thr Val Ala Lys Gin Cys Leu Gly Arg Gin Lys Asp Asp He Ser Tyr 
Lys Asp He Pro Pro Leu Phe Lys Ser Gly Pro Asp Gly Arg Ala Lys 
Val Gly Asn Tyr Cys Val He Asp Ser Val Leu Val Met Asp Leu Leu 
Leu Arg Phe Gin Thr His Val Glu He Ser Glu He Ala Lys Leu Ala 

55 Lys He Pro Thr Arg Arg Val Leu Thr Asp Gly Gin Gin He Arg Val 
Phe Ser Cys Leu Leu Glu Ala Ala Ala Thr Glu Gly Tyr lie Leu Pro 
Val Pro Lys Gly Asp Ala Val Ser Gly Tyr Gin Gly Ala Thr Val He 
Ser Pro Ser Pro Gly Phe Tyr Asp Asp Pro Val Leu Val Val Asp Phe 
Ala Ser Leu Tyr Pro Ser He He Gin Ala His Asn Leu Cys Tyr Ser 

60 Thr Leu He Pro Gly Asp Ser Leu His Leu His Pro His Leu Ser Pro 
Asp Asp Tyr Glu Thr Phe Val Leu Ser Gly Gly Pro Val His Phe Val 
Lys Lys His Lys Arg Glu Ser Leu Leu Ala Lys Leu Leu Thr Val Trp 
Leu Ala Lys Arg Lys Glu He Arg Lys Thr Leu Ala Ser Cys Thr Asp 
Pro Ala Leu Lys Thr He Leu Asp Lys Gin Gin Leu Ala He Lys Val 
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Thr Cys Asn Ala Val Tyr Gly Phe Thr Gly Val Ala Ser Gly He Leu 
Pro Cys Leu Asn He Ala Glu Thr Val Thr Leu Gin Gly Arg Lys Met 
Leu Glu Arg Ser Gin Ala Phe Val Glu Ala He Ser Pro Glu Arg Leu 
Ala Gly Leu Leu Arg Arg Pro Val Asp Val Ser Pro Asp Ala Arg Phe 
5 Lys Val He Tyr Gly Asp Thr Asp Ser Leu Phe He Cys Cys Met Gly 
Phe Asn Met Asp Ser Val Ser Asp Phe Ala Glu Glu Leu Ala Ser He 
Thr Thr Asn Thr Leu Phe Arg Ser Pro He Lys Leu Glu Ala Glu Lys 
He Phe Lys Cys Leu Leu Leu Leu Thr Lys Lys Arg Tyr Val Gly Val 
Leu Ser Asp Asp Lys Val Leu Met Lys Gly Val Asp Leu He Arg Lys 

10 Thr Ala Cys Arg Phe Val Gin Glu Lys Ser Ser Gin Val Leu Asp Leu 
He Leu Arg Glu Pro Ser Val Lys Ala Ala Ala Lys Leu He Ser Gly 
Gin Ala Thr Asp Trp Val Tyr Arg Glu Gly Leu Pro Glu Gly Phe Val 
Lys He lie Gin Val Leu Asn Ala Ser His Arg Glu Leu Cys Glu Arg 
Ser Val Pro Val Asp Lys Leu Thr Phe Thr Thr Glu Leu Ser Arg Pro 

15 Leu Ala Asp Tyr Lys Thr Gin Asn Leu Pro His Leu Thr Val Tyr Gin 
Lys Leu Gin Ala Arg Gin Glu Glu Leu Pro Gin He His Asp Arg He 
Pro Tyr Val Phe Val Asp Ala Pro Gly Ser Leu Arg Ser Glu Leu Ala 
Glu His Pro Glu Tyr Val Lys Gin His Gly Leu Arg Val Ala Val Asp 
Leu Tyr Phe Asp Lys 

20 

SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

C CTA TGT TAC TCT ACC CTG ATT CAG GGG AAC GCC ATT CTC TCG CAC 46 

Leu Cys Tyr Ser Thr Leu He Gin Gly Asn Ala He Leu Ser His 
CCC GAG TTG ACC CCG AAC GAC TAC GAA ACA TTC CAC CTA AGC GGA GGA 94 

25 Pro Glu Leu Thr Pro Asn Asp Tyr Glu Thr Phe His Leu Ser Gly Gly 

CCG GTG CAC TTC GTA AAA AAA CAC GTA CGA GAG TCA TTA CTG TCA AAA 142 
Pro Val His Phe Val Lys Lys His Val Arg Glu Ser Leu Leu Ser Lys 
CTT CTG ACG ACT TGG CTA ACA AAA AGA AAA GAG ATC CGC AAA AAT CTC 190 
Leu Leu Thr Thr Trp Leu Thr Lys Arg Lys Glu He Arg Lys Asn Leu 

30 GCC TCG TGC GGA GAC CCA ACC ATG CGA ACC ATC CTT GAT AAG CAG CAG 238 
Ala Ser Cys Gly Asp Pro Thr Met Arg Thr He Leu Asp Lys Gin Gin 
CTG GCC ATC AAG GTC ACA TGT AAT GCG GTG TAC GGG TTT ACC GGC GTC 286 
Leu Ala He Lys Val Thr Cys Asn Ala Val Tyr Gly Phe Thr Gly Val 
GCC TCC GGT ATT CTA CCG TGC CTG AAT ATT GCA GAA ACA GTC ACC CTC 334 

35 Ala Ser Gly He Leu Pro Cys Leu Asn lie Ala Glu Thr Val Thr Leu 

CAG GGC AGA AAA ATG CTA GAA ACG TCC CAG GCG TTT GTA GAG GGC ATA 382 
Gin Gly Arg Lys Met Leu Glu Thr Ser Gin Ala Phe Val Glu Gly lie 
TCG CCA AAA GAC CTG TCA GAC CTG ATA CAA CGT CCG ATC GAC GCT TCC 430 
Ser Pro Lys Asp Leu Ser Asp Leu He Gin Arg Pro He Asp Ala Ser 

40 CCG GAC GCC AGG TTT AAA GTG ATA 454 
Pro Asp Ala Arg Phe Lys Val He 

SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
Leu Cys Tyr Ser Thr Leu He Gin Gly Asn Ala He Leu Ser His Pro 

45 Glu Leu Thr Pro Asn Asp Tyr Glu Thr Phe His Leu Ser Gly Gly Pro 
Val His Phe Val Lys Lys His Val Arg Glu Ser Leu Leu Ser Lys Leu 
Leu Thr Thr Trp Leu Thr Lys Arg Lys Glu He Arg Lys Asn Leu Ala 
Ser Cys Gly Asp Pro Thr Met Arg Thr He Leu Asp Lys Gin Gin Leu 
Ala He Lys Val Thr Cys Asn Ala Val Tyr Gly Phe Thr Gly Val Ala 

50 Ser Gly He Leu Pro Cys Leu Asn He Ala Glu Thr Val Thr Leu Gin 
Gly Arg Lys Met Leu Glu Thr Ser Gin Ala Phe Val Glu Gly He Ser 
Pro Lys Asp Leu Ser Asp Leu He Gin Arg Pro He Asp Ala Ser Pro 
Asp Ala Arg Phe Lys Val He 

55 SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Val Ala Ser Gly He Leu Pro Cys Leu Asn He Ala Glu Thr Val Thr 
Leu Gin Gly Arg Lys Met Leu Glu Arg Ser Gin Ala Phe Val Glu Ala 
He Ser Pro Glu Arg Leu Ala Gly Leu Leu Arg Arg Pro Val Asp Val 
Ser Pro Asp Ala Arg Phe Arg Val He 



60 



SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Val Ala Ser Gly He Leu Pro Cys Leu Asn He Ala Glu Thr Val Thr 
Leu Gin Gly Arg Lys Met Leu Glu Arg Ser Gin Ala Phe Val Glu Ala 
He Ser Pro Glu Arg Leu Ala Gly Leu Leu Arg Arg Pro He Asp Val 
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Ser Pro Asp Ala Arg Phe Lys Val He 

SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Val Ala Ser Gly lie Leu Pro Cys Leu Asn He Ala Glu Thr Val Thr 
5 Leu Gin Gly Arg Lys Met Leu Glu Arg Ser Gin Ala Phe Val Glu Ala 

He Ser Pro Glu Arg Leu Ala Gly Leu Leu Arg Arg Pro Val Asp Val 
Ser Pro Asp Ala Arg Phe Lys Val He 

SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
10 Val Ala Ser Gly He Leu Pro Cys Leu Asn He Ala Glu Thr Val Thr 

Leu Gin Gly Arg Lys Met Leu Glu Arg Ser Gin Ala Phe Val Glu Ala 
He Ser Pro Glu Arg Leu Ala Gly Leu Leu Arg Arg Pro Val Asp Val 
Ser Pro Asp Ala Arg Phe Arg Val He 

15 SEQUENCE DESCRIPTION: SEQ ID N0:124: 

AACACAGAGT CNGTRTCNCC RTA 23 



20 



35 



50 



SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

AGCATCATCA TGGCCCAYAA YCTNTGYT 28 
SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

GAYTTYGCNA GYYTNTAYCC 20 



SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
25 CACCCATRCA YTCDATRTCR AA 22 

SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

TACAACGTCC TCTCCTTYGA YATHGARTG 29 

30 SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

GTCTGCGTGA AYGTNTTYGG NCA 23 



SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

GACGACCGCA GCGTGTGCGT GAAYGTNTTY GGNCA 35 
SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

ACGACCGCAG CGTGTGCGTG 20 



SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
40 TAAAAGTACA GCTCCTGCCC GAANACRTTN ACRCA 35 

SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TAAAAGTACA GCTCCTGCCC GAA 23 

45 SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

TTTGACTTTG CCAGCCTGTA YCCNAGYATN AT 32 



SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

TTTGACTTTG CCAGCCTGTA YCCNTCNATN AT 32 
SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

TTTGACTTTG CCAGCCTGTA 20 



SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
55 CGGCATGCGA CAAACACGGA GTCCGTRTCN CCRTADAT 38 

SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

TTAGCTACTC CGTGGAGCAG YTTRTCRAAR TA 32 

60 SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

TTGTGCGCTT GGATGATACT G 21 

SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

GAGGGCCTGC TGGAGGACGT G 21 
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SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

CGGTGGAGAA GCCGCAGGAT G 21 

5 SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

ACCTCCCGCA CCTGACCGTG T 21 

SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

AAGCTAGACA GGAGGAGCTT C 21 

10 

SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

ACTTGAATTA TCTTGACGAA C 21 

SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
15 ACGACAAGGT TCTGATGAAG G 21 

SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

AGAGACTCTT GGACGGAACT G 21 

20 SEQUENCE DESCRIPTION: SEQ ID N0:147: 

AGTTTGACTG CAGCTGGGAG G 21 

SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

CGGGTATCAG TGTGGAGTAG C 21 



25 



SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
GAGGACAAAG GTTTCGTAGT C 



21 



30 



SEQUENCE DESCRIPTION: SEQ ID NO: 150: 
CTATGTTACT CTACCCTGAT T 



21 



SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
GTATATCTCT TTAAACCTGG C 



21 



35 



SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
AACCTGGCGT CCGGGGAAGC G 



21 
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Claims 



We claim: 



5 1 . An isolated polynucleotide with a region encoding a DNA polymerase of a herpes virus, the 

polynucleotide comprising a sequence of nucleotides at least 69% identical to nucleotides 27 to 
501 of a sequence selected from the group consisting of SEQ. ID NO:1 and SEQ. ID NO:3. 

2. An isolated polynucleotide comprising a fragment of at least 18 consecutive nucleotides of the 
10 DNA polymerase encoding region of the polynucleotide of claim 1, wherein the sequence of 

said fragment is not contained in SEQ. ID NOS:1 1 0 or 1 1 1 . 

3. An isolated polynucleotide comprising a fragment of at least 50 consecutive nucleotides of the 
DNA polymerase encoding region of the polynucleotide of claim 1. 

15 

4. The isolated polynucleotide of claim 2, which encodes a polypeptide with nucleic acid binding 
activity, nucleotide binding activity, or DNA polymerase activity. 

5. An isolated polynucleotide with a region encoding a DNA polymerase of a herpes virus, the 
20 polynucleotide comprising a sequence of 26 nucleotides at least 80% identical to 

oligonucleotide LSGGA (SEQ. ID NO:107), or a sequence of 29 nucleotides at least 69% 
identical to oligonucleotide CTDPA (SEQ. ID NO: 108) or a sequence of 32 nucleotides at least 
80% identical to oligonucleotide KM LEA (SEQ. ID NO:22), or a sequence of 29 nucleotides at 
least 69% identical to oligonucleotide GISPA (SEQ. ID NO: 109). 

25 

6. An isolated polynucleotide comprising a fragment of at least 18 consecutive nucleotides of the 
DNA polymerase encoding region of the polynucleotide of claim 5, wherein the sequence of 
said fragment is not contained in SEQ. ID NOS:1 1 0 or 1 1 1 . 

30 7. The polynucleotide of claim 1 or claim 2, wherein said herpes virus is capable of infecting 

primates. 

8. The polynucleotide of claim 1 or claim 2, wherein said herpes virus is RFHV, RFHV2, or KSHV. 

35 9. An isolated polynucleotide comprising a linear sequence of at least 18 nucleotides identical to 

a linear sequence between nucleotides 27 to 501 inclusive of SEQ. ID NO:1, or between 
nucleotides 27 to 501 inclusive of SEQ. ID NO:3, or between nucleotides 36 to 2499 inclusive 
of SEQ. ID NO:116, or between nucleotides 1 to 454 inclusive of SEQ. ID NO:118, but not to a 
linear sequence within either SEQ. ID NO:110or SEQ. IDNO:111. 

40 
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10. The isolated polynucleotide of claim 9, comprising a linear sequence essentially identical to 
nucleotides 27 to 501 of SEQ. ID NO:1, or to nucleotides 27 to 501 of SEQ. ID NO:3, or to 
nucleotides 36 to 2499 of SEQ. ID NO:1 16, or to nucleotides 1 to 454 of SEQ. ID NO:1 18. 

5 1 1 . An isolated polypeptide encoded by the polynucleotide of claim 3. 

12. An isolated polypeptide, comprising a linear sequence of at least 12 amino acids essentially 
identical to a sequence between amino acids 10 to 167 inclusive of SEQ. ID NO:2 or between 
amino acids 10 to 167 inclusive of SEQ. ID NO:4 or between amino acids 13 to 833 inclusive of 

10 SEQ. ID NO: 117, or in any of SEQ. ID NOS:119-123, but which is not contained in SEQ. ID 

NOS:1 12 or in SEQ. ID NO:1 13. 

13. The isolated polypeptide of claim 12, which has nucleic acid binding activity, nucleotide binding 
activity, or DNA polymerase activity. 

15 

14. A fusion polypeptide comprising the amino acid sequence of an isolated peptide of claim 12, 
joined to a second amino acid sequence. 

15. An isolated polypeptide, comprising a linear sequence of amino acids identical to a sequence 
20 selected from the group consisting of SEQ. ID NOS:80, 82, 84, 86, 88, and 90 to 103. 

16. An isolated or non-naturally occurring polynucleotide encoding the polypeptide of claim 12. 

17. A polynucleotide encoding a fusion polypeptide, comprising the polynucleotide of claim 2 joined 
25 directly to a second polynucleotide encoding a polypeptide. 

18. A recombinant cloning vector comprising a polynucleotide sequence encoding a polypeptide of 
at least 12 consecutive amino acids identical to that of a polypeptide according to claim 12. 

30 19. A recombinant expression vector comprising a polynucleotide sequence encoding a 

polypeptide of at least 12 consecutive amino acids identical to that of a polypeptide according 
to claim 12. 

20. A recombinant cloning vector comprising a linear sequence encoding a DNA polymerase of a 
35 herpes virus, the sequence being at least 18 nucleotides long and identical to a linear 

sequence within SEQ. ID NOS:1, 3, 116, or 118, but not in SEQ. ID NOS:110 or 111. 

21. A host cell genetically altered by the polynucleotide of claim 16, or by the vector of claim 18, 
claim 19, or claim 20. 

40 
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22. A monoclonal or isolated polyclonal antibody specific for a DNA polymerase encoded in said 
encoding region of the polynucleotide of claim 1 . 

23. A monoclonal or isolated polyclonal antibody specific for the polypeptide of claim 12. 

5 

24. The antibody of claim 22, which is a monoclonal antibody. 

25. The antibody of claim 22, which is an isolated polyclonal antibody. 

10 26. An oligonucleotide essentially identical to an oligonucleotide selected from the group consisting 

of SEQ. ID NOS:5to 16, 21, 22, 104-109, and 124-152. 

27. A method of obtaining an amplified copy of a polynucleotide encoding a DNA polymerase, 
comprising the steps of: 

1 5 a) contacting the polynucleotide with the oligonucleotide of claim 26; and 

b) elongating oligonucleotide that has formed a duplex with the polynucleotide. 

28. The method of claim 27, comprising performing a polymerase chain reaction (PCR). 

20 29. The method of claim 28, wherein said PCR comprises repeated cycles of annealing and 

elongating, wherein the annealing is conducted at a temperature of at least 60°C, 

30. The method of claim 28, wherein said PCR is conducted in a buffer containing 10-30 mM 
(NH4) 2 S0 4 and 1-10 mM MgCI 2 . 

25 

31. The method of claim 27, wherein the polynucleotide which is amplified is first obtained from a 
biological sample taken from an individual affected with a disease featuring fibroblast 
proliferation and collagen deposition. 

30 32. A method of detecting viral DNA or RNA in a sample of primate origin, comprising the steps of: 

a) contacting the DNA or RNA in the sample with a probe comprising the polynucleotide of 
claim 2 under conditions that would permit the probe to form a stable duplex with a 
polynucleotide having the sequence shown in SEQ. ID NO:1, and with a polynucleotide having 
the sequence shown in SEQ. ID NO:3, but not with a polynucleotide having a sequence of any 

35 of SEQ. ID NOS:24 to 29; and 

b) detecting the presence of said stable duplex formed in step a), if any. 

33. The method of claim 32 further comprising conducting an amplification reaction on the DNA or 
RNA of the sample prior to being contacted with the probe. 

40 
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34. The method of claim 33, wherein the amplification reaction is conducted using an 
oligonucleotide primer comprising a sequence according to claim 26. 

35. A method of detecting viral DNA or RNA in a sample of primate origin, comprising the steps of: 

a) contacting the DNA or RNA in the sample with an oligonucleotide probe comprising a 
sequence shown in SEQ. ID NOS: 21, 22, 107, 108, or 109, under conditions that would permit 
the probe to form a stable duplex with a polynucleotide having the sequence shown in SEQ. ID 
NO:1, and with a polynucleotide having the sequence shown in SEQ. ID NO:3, but not with a 
polynucleotide having a sequence of any of SEQ. ID NOS:24 to 29; and 

b) detecting the presence of said stable duplex formed in step a), if any. 

36. A method of detecting viral DNA or RNA in a sample, comprising the steps of: 

a) contacting the DNA or RNA in the sample with an oligonucleotide probe comprising a 
sequence shown in SEQ. ID NOS:22, 107, 108 or 109 under conditions that would permit the 
probe to form a stable duplex with a polynucleotide having the sequence shown in SEQ. ID 
NO:1, and with a polynucleotide having the sequence shown in SEQ. ID NO:3, but not with a 
polynucleotide having a sequence of any of SEQ. ID NOS:23 to 29; and 

b) detecting the presence of said stable duplex formed in step a), if any. 

37. A method of detecting viral DNA or RNA in a sample, comprising the steps of: 

a) conducting an amplification reaction on a polynucleotide in the sample using the 
oligonucleotide of claim 26 as a primer in the reaction; and 

b) detecting the presence of amplified copies of the polynucleotide, if any. 

38. An isolated polynucleotide capable of forming a stable duplex with an oligonucleotide 
comprising a sequence selected from the group consisting of SEQ. ID NO:107, SEQ. ID 
NO:108, and their respective complementary sequences, under conditions wherein the 
oligonucleotide is capable of forming a stable duplex with a polynucleotide having the 
sequence shown in SEQ, ID NO:1, and with a polynucleotide having the sequence shown in 
SEQ. ID NO:3, but not with a polynucleotide having a sequence of any of SEQ. ID NOS:23 to 
29. 

39. An isolated polypeptide comprising a linear sequence of 12 amino acids encoded within the 
polynucleotide of claim 38. 

40. A method for detecting infection of an individual by a herpes virus, comprising detecting viral 
DNA or RNA in a biological sample obtained from the individual, wherein the detecting of viral 
DNA or RNA is by the method of claim 32 
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41. A method for detecting infection of an individual by a herpes virus, comprising detecting viral 
DNA or RNA in a biological sample obtained from the individual, wherein the detecting of viral 
DNA or RNA is by the method of: 

a) contacting the DNA or RNA in the sample with a probe comprising the polynucleotide of 
claim 2 under conditions that would permit the probe to form a stable duplex with a 
polynucleotide having at least one sequence selected from the group consisting of SEQ. ID 
NOS:1, 3, 116, or 118, but not with polynucleotides having a sequence of any of SEQ. ID 
NOS:24 to 29; and 

b) detecting the presence of said stable duplex formed in step a), if any. 



42. A method for detecting infection of an individual by a herpes virus, comprising detecting viral 
DNA or RNA in a biological sample obtained from the individual, wherein the detecting of viral 
DNA or RNA is by the method of: 

a) contacting the DNA or RNA in the sample with a probe comprising the polynucleotide of 
15 claim 2 under conditions that would permit the probe to form a stable duplex with a 

polynucleotide having a sequence shown in SEQ. ID NO:116, but not with polynucleotides 
having a sequence of any of SEQ. ID NOS:24 to 29; and 

b) detecting the presence of said stable duplex formed in step a), if any. 

20 43. A diagnostic kit for detecting a herpes virus polynucleotide in a biological sample, comprising a 

reagent in suitable packaging, wherein the reagent comprises the polynucleotide of claim 2. 

44. A diagnostic kit for detecting a herpes virus polynucleotide in a biological sample, comprising a 
reagent in suitable packaging, wherein the reagent comprises the oligonucleotide of claim 26. 

25 

45. A method of detecting infection of an individual by a herpes virus, comprising the steps of: 

a) contacting antibody from a sample obtained from the individual with the polypeptide of 
claim 11 or claim 12 under conditions that permit the formation of a stable antigen-antibody 
complex; and 

30 b) detecting said stable complexes formed in step a), if any. 

46. A diagnostic kit for detecting an anti-herpesvirus antibody present in a biological sample, 
comprising a reagent in suitable packaging, wherein the reagent comprises the polypeptide of 
claim 11 or claim 12. 

35 

47. A method of detecting infection of an individual by a herpes virus, comprising the steps of: 

a) contacting a polypeptide from a sample obtained from the individual with the antibody of 
claim 22 or claim 23 under conditions that permit the formation of a stable antigen-antibody 
complex; and 

40 b) detecting said stable complexes formed in step a), if any. 
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48. A diagnostic kit for detecting a herpes virus polypeptide present in a biological sample, 
comprising a reagent in suitable packaging, wherein the reagent comprises the antibody of 
claim 22 or claim 23. 

5 

49. A composition for use in the treatment of herpes virus infection, comprising the polynucleotide 
of claim 2 and a compatible pharmaceutical excipient. 

50. A method of determining whether a pharmaceutical candidate is useful for treating gamma 
1 0 herpes infection, comprising the steps of: 

a) contacting the polypeptide of claim 1 1 with the pharmaceutical candidate; and 

b) determining whether a biochemical function of the polypeptide is altered by the 
pharmaceutical candidate. 

15 51. The method of claim 50, wherein the biochemical function of the polypeptide determined in 

step b) is the binding of the polypeptide to a nucleic acid. 

52. The method of claim 50, wherein the biochemical function of the polypeptide determined in 
step b) is DNA polymerase activity. 

20 

53. A method of determining whether a pharmaceutical candidate is useful for treating gamma 
herpes infection, comprising the steps of: 

a) genetically altering a cell with the polynucleotide of claim 2; and 

b) determining the effect of the pharmaceutical candidate on the cell in comparison with a cell 
25 not genetically altered with the polynucleotide. 

54. A method of obtaining a compound for use in treating an individual infected with herpes virus, 
comprising the steps of: 

a) creating a compound capable of binding a region of the polypeptide of claim 1 1 or claim 
30 12 involved in interacting with a nucleic acid; and 

b) determining whether the compound interferes with a biochemical function of the 
polypeptide. 
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KSHV GTGTTC6ACTTTGCTAGCCTCTACCCCAGTATCATCCAAGCGCACAACTTGTGCTACTCC 60 

VFDFASLYPSI IQAHNLCYS 
(DFASA>) gtgttcgacttygcnagyytntaycc 



KSHV ACACTGATACCCGGCGATTCGCTCCACCTGCACCCACACCTCTCCCCGGACGACTACGAA 120 
TLIPGDSLHLHPHLSPDDYE 



KSHV ACCTTTGTCCTCAGCGGAGGTCCGGTCCACT1TGTAAAAAAACACAAAAGGGAGTCCCTT 180 
TFVLSGGPVHFVKKHKRESL 



KSHV CTTACCAAGCTTCTGACGGTATGGCTCGCGAAGAGAAAAGAAATAAGAAAGACCCTGGCA 240 
LTK L LTVWLAKRKE I RKTLA 



KSHV TCATGCACGGACCCCGCACTGAAAACTATTCTAGACAAACAACAACTGGCCATCAAGGTT 300 
SCTDPALKTILDKQQLAIKV 



( PCLNA>) gtcgcctctggcatcctnccntgyctnaa 
(ILPCA>) ggcatcctaccgtgcctgaac 
(VASGA>) cgtcgcttccggcatcctacc 
TCNAVYGFTGVASGI LPCLN 
RFHV ACGTGCAACGCGGTGTACGGGTTTACGG GCGTCGCTTCCGGCATCCTACCfiTfirrTCAAr 360 

** AAAAAAAAAAA ***** ** AAAAAAAA ** ** ***** ** ** ***** *** 

KSHV ACCTGCAACGCGGTTTACGGCTTCACGG GCGTTGCCTCTGGCATACTGCCTTGCCTAAAr 360 
TCNAVYGFTGVASGI LPCLN 
(VYGA>) acgtgcaacgcggtgtayggnktnacngg (CLNIA>) ctgccttgcctaaac- 

(SGILA>) gcgttgcctctggcatactg 



(GISPA>) tctcaggcgttcgta - 
(KMLEA>) cagggccggaagatgctggaracrtcncargc 
IAETVTLQGRKMLEISQAFV 
RFHV ATCGCAGAGACGGTGACCCTCCAfififi CAGGAAAATGCTGGAAACGTCTCAGGCfiTTr.fSTA 420 

** ** AAAAA ***** ****** * ** AAAAAAAA * AAAAAAAA ** *** 

KSHV ATAGCGGAGACCGTGACACTAnAAfiK GCGAAAGATGCTGGAGAGATCTCAGGCCTTTfiTA 420 

IAETVTLQGRKMLERSQAFV 
-atagcg (CLNIA>) 
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(<PEARB) ccgcagaggc- 

•garggnathtcncc (GISPA>) (<PIEAB) ctggctagctccgcagaggc- 

ESISPIALA.DLLQRPI£ASP 
RFHV QAGGQ MTCTCGCCMC6GCACTGfiCAGACCTACT6CAfiCfiACCGATCfiAfifirRTrTr:r:R 480 

**** AAAAAAAA ***** ** **** ********* **** 

KSHV GA(^CC ATCTCGCCGGMC6CCTARCG(^TCTCCTfir.fi^fifiCCAATARAr.RTr.Tr.Ar.rr 480 
EAISPEBLASLLBRPIDYSP 

(<IEASB) tccggttatctgcagagtgg 
(<EARFB) gcagagtggg- 



-cttcggtccaa (<PEARB) 
-c (<PIEAB) 

EARFKVIYGDTDSVFVAC 
RFHV GMGCCAGGTTTAAARTCATATAfQyRArArrRArTrrCTCTTTqjrqr^jqrrq 536 

** *** ******* AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

KSHV GACGCCCt^TTrAAf^TrATATArfiiyRArArrfiArTrrftTCTjTqT^y^TyrR 536 
DARFKVIYGDTDSVFVAC 

(<GD7D1B) atrccnctrtgnctgaggcacaaacagcgtacggc 
-ctgcgggctaa (<EARFB) 



v 
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hHV6 V MDSV-SFFNPYLEA NRLKK 

hHSVl MFSGG GGPLSPGGKSAARAASGFFAPAGPRGAGR-GPPPCLRQNFYNPYLAPVGTQQ 

hHSV2 MFCAA GGPASPGGKSAARAASGFFAPHNPRGATQTAPPPCRRQNFYNPHLAQTGTQP 

hVZV M AIR TGFCNPFLTQASGI KYNPRTGRGSN — 

eHVl M AAREQANSVRR — SGFFNPFIGKRP FFRPGSGQTAETE 

hEBV M SGGL — FYNPFL RPNKGLLKKPD — 

sHVl M D FYNPYL SKKPTDTKTPKLH 

mCMV M DTCVETFFNPYL RRK PRRDWRR CED 

gpCMV M SAPV — FFNPYLCGGAARRRNG-CSTVDSRR VNGPTKK 

hCMV M FFNPYLSGGVTGGAVA-GGRRQRSQPGSAQGSGKR 

iHVl MDRNAVLYGVLEHRLPKWVELSDDTDLEPFFFSSVRYITAGS EDAIMIQA 



hHV6 KSRS SYIRILPRGIMHDGAA GL I KDVC D S E PRMFYRDRQ YL L S K 

hHSVl KPTGPTQRHTYYSECDEFRFIAPRVLDEDAPPEKRAGVHDGHLKRAPKVYCGGDERDV-L 

hHSV2 KAPGPAQRHTYYSECDEFRFIAPRSLDEDAPAEQRTGVHDGRLRRAPKVYCGGDERDV-L 

hVZV — REF--LHSYKTTMSSFQFLAPKCLDEDVPMEERKGVHVGTLSRPPKVYCNGKEVPI-L 

eHVl RPRPP — QHSYCTEVGSFKFIAPRCLDEEAPADQRRGVHVGTLERPPKVYCDGSEYDV-L 



hEBV — KE YL-RLIPKCFQTPGAA GWDVRGPQPPLCFYQDSLTWGG 

sHVl TTRQ SICRLVPKCFRNPTEK GWSVS S FALPTYFFKGNENKVYL 

mCMV N-NK NFLQWPRGVLYDGAT GLIKVQSGMEPRMFYAEKEYVLNP 

gpCMV G-KK S FLQ WRRGVI YDGEK GLIKKVTQHPPRMFYNNVQYLLEP 

hCMV PPQK QFLQ I VPRGVMFDGQT GL I KHKTGRLPLMFYRE I KHLL SH 



iHV 1 LNLNT DE IWFLVTNLNFMAL I PTVY I ENPG IRQL IAS TPISYRSPI TVFNGD 



hHV6 EMTWPSL — DIARSKDY DHM-RMK-FHIYDAVETL — MFTDS IENLPFQYRHFV 

hHSVl RVG S GG FWPRRS RLWGGVDHAP AGFNPTVT VFHVYD I LENVEHAYGMRAAQFHARFMDA I 

hHSV2 RVGPEGFWPRRLRLWGGADHAPEGFDPTVTVFHVYDILEHVEHAYSMRAAQLHERFMDAI 

hVZV DFRCS S PWPRRVNI WGE I DFRGDKFDPRFNT FHVYDI VETTEAA SNGDVS RFAT AT 

eHVl NFASGGCWPRR I RVWNGQDFRGDGFNPRFERFHVYD I VET S ES A SHDDPSRFAELS 

hEBV DEDGKGMWWRQRAQEGTARPEADTHGSPLD-FHVYDILETV — YTHEKCAVIPSDKQGYV 

sHVl -ENGKSMWHLRRPCKNALLEEQ SIT-FHIYDIVETT — YSEDRCNDI PFKFQTD I 

mCMV DKPWP TLRTRGWCRGPYSDD VR-FHTYDQWNL — VLADSDEQISPRWNSHV 

gpCMV QMSWP TLPCRETCRVGCGREQ-PLR-FHTFDQIDST — VYADSVEQ I FLGYRRHV 

hCMV DMVWP CPWRET LVGRV VG-PIR-FHTYDQTDAV — LFFDSPENVSPRYRQHL 

iHVl LKKWMDCDLFVFGTMAAQKAF IKAGNSVLGSLGGNVYTYGDHV 



hHV6 IPSGTVIRMFGR-TEDGE K I CVNVFGQEQ YFY CECVDGRSLKAT INN 

hHSVl TPTGTVITLLGL-TPEGH RVAVHVYGTRQYFYMNKEEVDRHLQCRAPRDLCERMAA 

hHSV2 TPAGTVITLLGL-TPEGH RVAVHVYGTRQYFYMNKAEVDRHLQCRAPRDLCERLAA 

hVZV RPLGTVITLLGM- SRCGK RVAVHVYG I C Q YFY INKAEVDT ACG I RS G S EL S VLLAE 

eHVl RPSGSWTLLGM-SECGK RVAVHVYGVRHYFYMAKAEVDSACGITTEAELVRAMVD 

hEBV VPCGIVIKLLGRRKADGA SVCVNVFGQQAYFY ASAPQGLDVEFAVLS 

sHVl I PNGTVLKLLGR-TLEGA SVCVNVFGQRNYFY VKVPEGGNITYLIKQ 

mCMV VPAGNVI RMFGA-TDEGV SVCVNVFGQKAYFY CERMQSEDLKNTVYD 

gpCMV VPCGNVIRMFGR-TCDGS SVCVNVFGQPSYFY CEYDGSEGYLDNYLS 

hCMV VP S GNVLRFFGA-T EHGY S I CVNVFGQRS YFY CEYSDTDRLREVIAS 

iHVl SNFDGNTPVLQNNLMCSHVYYTRYKTDVYAPWEFYYDQKRDQGYL MSLPAIIPR 
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Figure 2 (2) 

hHV6 LMLTGE VK MS C S FV I E PADKL S L YG YNANT WN 

hHSVl ALRESP GASF RGISADHFEAEWERTDVYYYETRPAL- 

hHSV2 ALRESP GASF RGISADHFEAEWERADVYYYETRPTL- 

hVZV CLRSSM- ITQNDATLNGDKNAFHGTSF KSASPESFRVEVIERTDVYYYDTQPCA- 

eHVl CAHSSA-LSAALGNGNGGKQSGGSGGGWWGG^KHVSADCFKVETVCHTTLYYFGSKPAL- 

hEBV -ALKAS -TFDRR TPCRVS VEKVTRRS IMGYGNHA-GD 

sHVl -ALNEK — FS PSCAYQTEAVKKKILSRYDPEE-HD 

mCMV IADKVP-EPCSP FSVS I SPVTKS SFYGYGLGHI PN 

gpCMV T VLKET - EDVTK IVFTLDAQRVHKYSLFGYNTKYIEN 

hCMV VGELVP-EPRTP YAVSVTPATKTS IYGYGTRPVPD 

iHVl CKREGAFDIETIVHENAMDQDLNCQKFFKSEFRSMEESQVLIQRFREAGVTGLPPSPFVG 



hHV6 LFKVSFGNFWSQRIGKILQN-EGFVVYEIDVDVLTRFFVDN-GFLSFGVmJVKKYIPQD 

hHSVl FYRVYVRSGRVLSYLCDNFCP — AIKKYEGGVDATTRFILDNPGFVTFGWYRLKPGRNNT 

hHSV2 YYRVFVRSGRALAYLCDNFCP — A I RKYEGGVDATTRF I LDNPGFVT FGWYRLKPGRGNA 

hVZV FYRVYS P S S KFTNYLCDNFHP — ELKKYEGRVDATTRFLMDNPGFVSFGWYQLKPGVDGE 

eHVl YYRVSASSSRLGGFICDNFHP — EITKFEGSVDVTTRLLLDNENFTSFGWYRLRPGTHGE 

hEBV YHKITLSHPNSVCHVATWLQDKHGCRIFEANVDATRRFVLDN-DFVTFGWYSCRRAIPRL 

sHVl VFKVTVSSSLSVYKISDSLVS-NGCEVFETNVDAIRRFVIDN-DFSTFGWYTCKSACPRI 

mCMV LYRLSFNNWNMCRKIGKRMLE-EGRKVYELGVDPLARFLIDR-KI^ 

gp CMV L YRVTLNNWPVC KRLAQNL Q S - RGLRVYEAGVDPVARFC VDR- KIPS FGWC V I KRFYARS 

hCMV LQCVSISNWTMARKIGEYLLE-QGFPVYEVRVDPLTRLVIDR-RITTFGWCSVNRYDWRQ 

iHVl ITQKLHEIVSISLWCNYHKTGPKKKEY YVYYNTKK 



hHV 6 MGK GSNLEVE INCHVS DLVS L - EDVNWPLYfcCWSFiDIECLGbNGN FPDAE 

hHSVl LAQPRAPMAFGTSSDVEFNCTADNLAIEGGMSDLPAY KLMCFIMECKA 3GEDELAFPVAG 

hHSV2 PAQ P RP PT AFGT S S DVE FNCTADNLAVEGAMCDL PAY KLMCFfcXECKA 3GEDELAFPVAE 

hVZV RVRVRPAS RQLT L S DVE I D CMS DNLQAI PNDDS WPDY KLLCFEIECKS 3GSNELAFPDAT 

eHVl RVQLRPVERHVTS SDVE INCTPDNLEP I PDEAAWPDY KIMGFJDIECKA 3TGNEMAFPVAT 

hEBV QHR DSYAELEYDCEVGDLSVRREDSSWPSY QAI*AFI>IECLG SEG FPTAT 

sHVl TNR DSHTDIEFDCGYYDLEFHADRTEWPPY STIM5FDXBCIG 2KG FPCAK 

mCMV AGY VSRAQLEIDCDVADILPIEEQSNWPFY RCLSFDIECMS STGA FPAAE 

gp CMV SGL AS FCD I E I DCE I GDVEADDSDMSWPEY RCASFDIEGMS SGDR FPDS S 

hCMV QGR AST CDIEVDCDVSDLVAVPDDSSWPRY RCLSFDIECMS 3EGG FPCAE 

iHVl MENPMEMIPVEHLHLDASRIKFEACKNE FY M&S&F-lNRLR <SVNVL-YVYNA 



hHV6 NLGD I VI Q I SVI S FDTEG DRDERHLFTLGTCEKID- 

hHSVl HPEDLVIQISCLLYDLST-TALEHVLLFSLG-SCDLPESHLNELAARGL P 

hHSV2 RPEDLVIQISCLLYDLST-TALEHILLFSLG-SCDLPESHLSDLASRGL P 

hVZV HLEDLVIQISCLLYSIPR-QSLEHILLFSLG-SCDLPQRYVQEMKDAGL P 

eHVl NQEDLVI Q I S CLLYSLAT - QNHEHTLLFSLG- S CD I SEE YS FACVQRGE P 

hEBV NEADLILQISCVLWSTGEEAGRYRRI LLTLGTCEDIE- 

sHVl NEGDL 1 1 Q I SCVFWHAGALDTT -RNM LLSLGTCSAVE- 

mCMV NVDDIIIQISCVCF G VG EMVHHA YD VHAD L S T P A VP EN HLFTIGPCAPI-P 

qpCMV MVDDIVIQISVICY AVGRSGAE S DGVSG — AEAAVREHQ — HLFTLGPCAPI-P 

hCMV KSDDIVIQISCVCYETGGNTAVDQGIPNGNDGRGCTSEGVIFGHSGLHLFTIGTCGQVGP 

iHVl QFD I QVI QQRLRYYAFKQRAPR CCKGHDDIPHEWGKALMEKWEAFLSVKP 
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Figure 2 (31 
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GVHIYEFASEFELLLGFFIFLRIE: 
TPVVLEFDSEFEMLLAFMTLVKQYGPEFV^ 



EPTVLEFDSEFELL IAFMTLVKQ YAPEFA POYNH 
RPTVLEFDSEYELLVAFLTFLKQYSPEFA 



DVKIYTFPSEYEMLRGFFIFLSWYSPEFI 



QL FKA — QILMG- 



LDGYGRM 
KLNS I YSLKLDGYG S I 
S VYNI KLDGYGKF 
RARHIYSINPASLGKI 
IYNIKLSDYSRV 

L 
CKL 

CTI-KERIDSYRKM 




QD I LKANYLKLLpG IGSVSAQ&KS3M \KMi 



hHV6 
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hVZV 
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K NGKIGIS-VPHEQYRKGFLQA QTKVFT SG VL YLDMY P VYS SK I 

N--GRGVFRVWDIGQSHFQK RSKIKVNGMVNIDMYGIITDKI 

N--GRGVFRVWDIGQSHFQK RSKIKWGMVNIDMYGIITDKV 

N--RGGLFKIWDVGKSGFQR RSKVKINGLISLDMYAIATEKL 

N--KGGLFKVWDIATNHFQK KSKVKINGLI SLDMYSVATEKL 

RA GGVCEVR-RPHDA-GKGFLRA NTKVRITGLIPIDMYAVCRDKL 

KT--GSIFQVH-TPKDT-GNGFMRS VSKIKISGI IAIDMYIVCKDKL 

RR GGRMFVF-SPEKG KAGFGTSNTVKVFWSGTVVLDMYPVCTAKA 

RF GGRFS I Y-VPVGT KPRNASSASIKVHCTGTWLDMYPVCVAKT 

PTAQGGRFFLH-SPAVGFKRQYAAAFPSASHNNPASTAATKVYIAGSWIDMYPVCMAKT 
KDTVQN-FKSHGFGCDIIDMMYV CKRKEFEAKDGSLNTVAQLIIKKFKPHKATP 



hHV6 

hHSVl 

hHSV2 

hVZV 

eHVl 

hEBV 

sHVl 

mCMV 

gpCMV 

hCMV 

iHVl 



EXO 3 

TAQNYKLDT IAK I CLQQEKEQLS YKE I PKKF I SGPSGRKvVGKYCLQDSVLVlVTlLFKQ IN 
KL S S YKLNAVAEAVLKDKKKDL S YRD I PAYYATGPAQR SVIGBYCIQDSLLV 3QLFFKFL 
KLS S YKLNAVAEAVLKDKKKDLSYRBI PAYYASGPAQR SVIGEYCVQD5LLV 3QLFFKFL 



KLPSYKLDAWGDVLGEHKIDLPYKE I PS YYAGGPDRR 3VI GEYCIQDSRLV SKLFFKYL 
SLSDYKLDTVARHLLGAKKEDVHYKEIPRLFAAGPEGR RRLGMYCVQDSALV VIDLLNHFV 
SLSNYKLDTVANHCIGAKKEDVSYKDIMPLFMSGPEGR ^IGLYCVIBSVLV VIKLLKFFM 
S S PNYKLDTMAE I YLKKKKDDL S YKE I PVQFSAGDEGR I^GG-KYCLQIiAVIxV RELFEMLA 



KI 



•HKMDD I T YDKLDG YYRAGGTKlfaaSCLTYNL IDSUVV IRIAKNLK 



IN 



hHV6 

hHSVl 

hHSV2 

hVZV 

eHVl 

hEBV 
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gpCMV 

hCMV 

iHVl 



YHFEVAEVARLAHVTARC WFEGQQKK I FP C I LTEAKRRNMI LP SMVS 

PHLELSAVARLAGINITRTIYDGQQIRVFTCLLRLADQKGFILPDTQGRFRGAGGE 

PHLELSAVARLAG INITRT I YDGQQ I RVFTCLLRLAGQKGFILPDTQGRFRGLDKE 

PHLELSAVARLARITLTKAIYDGQQVRIYTCLLGLASSRGFILPD 

PHLELSAVAKIARITLTRVIFIX5QQIRVYTCLLKLARERNFILPDNRRRFDSQADA 

IHVEVAE IAK IAHI PCRRVLDDGQQ IRVFS CLLAAAQKENFILPM 

IHVE I SEIAKLAKI PTRRVLTDGQQ I RVFSCLLAAARAENYILPV 

FHFEAAAIARLARIPLRKVIFDGQQIRIYTCLLEECSGRDMILPNMPSLG 

YHYEAAS VARLARI PLRS VI FEGQQ I R I YS C ILEEAGERNMILP S FLTAK 

FHYEAGAIARLAKI PLRRVI FDGQQ IRI YTSLLDECACRDFILPNHYSKGTTVPETNS VA 

PMEEYIY RQLAC YN I DTAAHT RGVMNFCGFIQSTKWE 
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Figure 2 (4) 
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hHV6 

hHSVl 

hHSV2 

hVZV 

eHVl 

hEBV 

sHVl 

mCMV 

gpCMV 

hCMV 

iHVl 



hHV6 

hHSVl 

hHSV2 

hVZV 

eHVl 

hEBV 

sHVl 

mCMV 

gpCMV 

hCMV 

iHVl 



hHV6 

hHSVl 

hHSV2 

hVZV 

eHVl 

hEBV 

sHVl 

mCMV 

gpCMV 

hCMV 

iHVl 



-APKRPAAAREDEERP EEEG-EDEDEREE 

-APKRPAVPRGEGERPGDGNGDEDK-DDDEDGDE 

GGYPATFEYKDVI PDVGDVEEEMD 

•ASETSELAMDSQSHAFDSTDEPDGVDGTPDAAG 



HGAAAAIEEAAAGG EGD 

RPGELATESSPVASFEEDSE 

VSPNAAIISTAAVPGDAGSV7^AMFQMSPPLQSAPSSQDGVSPGSGSNSSSSVGVFSVGSG 
VSRNKARLDAGIVMATDYIRNSLF 

REGION 2 

(DFASA» 



SHNRQG I G YKGATVLEPKTG- YYAVPT WF DFQSLYPS IMMAH 

GGG ERE PEGARET AGRH - - VG YQGAKVLDPT SG -FffVNP WW- — DFA5LYPS 1 1 QAH 

DGDERE -EVARETGGRH - - VG YQGARVLDPTSG-FHVDPVVVF DFASLYPS 1 1 QAH 

EDESVSPTGTSSG — RNT — VG YKGARVFDPDTG-FYIDPVVVL -DFASLYP S 1 1 QAH 

SGATSENGGGKPGVGRA — VG YQGAKVLDFVSG-FHVDPVVVF -—DFASLYPS 1 1 QAH 

PSASDRDG YQGATVIQPLSG-FYNSPVLW -DFASLYPS 1 1 QAH 

SNDVNADG FQGATVINP IPG-FYNNAVLVV ^DFASJLYPSIIQAH 

ETSE GENSNNSRT VG YQGATVLEPECG-FHHVPVCVF — - -DFASLYP S I IMSN 

QT SDS SLGEVS S QGS SDGGVG YQGAXVLEFDVG-FYDTFVAVF -DFASLYPS I IMRH 

S S GG VGVSNDNHGAGGTAAVS YQGATV^PEVG-YYNBPVAVF- - --DFA5LYPS I IMAH 
XPET 1 1 RRGGFVMAPLTGLFFARPTQCFELCLDFTSMYPSMMCDL 



2 cont. 



NLCYS r LVLDE — RQ I AG • 



REGION 4 

LSES-DILTVKLGDETH-RFVKPCIREBVLGSL- 

NLCFSfrLSLRAD — AVAH LEAG - KD YLE I EVGGRRL FFVKAHVRE 3LLS IL - 

NLCFS TLSLRPE — AVAH LEAD -RDYLE I EVGGRRL FFVKAHVRE 3LLSIL- 

NLCFT TLTLNFE — TVKR LNP S — DYATFTVGGKRLFFVRSNVRE 3LLGVL- 

NLCFT TLALDEV — DLAG LQPS-VDYSTFEVGDQKLFFVHAHIRE 5LLGIL- 

NLCYS rMI T PGEEHRLAG LRPG-EDYESFRLTGGVYHFVKKHVHE 5FLASL- 

NLCYS ILI PHHALHNYPH LKSS — DYETFMLSSGPIHFVKKHIQA 5LLSRL- 

NLCYS ILLVEG — SPE VPEK-DVLRVEIGDQCH-RFVRENVHR 3LLAEL- 

NLCYS TYLPLG — RDD-G LSDD-DVFLLEFDDGTRYGFVREHVRK SILGEL- 

NLCYSTLLVPG — GEY-P VDPA-DVYSVTLENGVTHRFVRASVRV3VLSEL- 

NlSPg r I VPS DKTNRVGDYMGYDWSK I DQGFEKFTLVLRVDRTDFENPKLVRH FSDTSLS 

REGION 3 

4 cont. (VYGA>) 



LRDWIAMRKQIR 
LRDWLAMRKQIR 



3RIPQ-S 

5RIPQ-S P P EEAVLlJDXQQAAI KVVCNS VYGFTGVQHGLLPCLHVAAT 



LKDttLAMRKAIR \RIPG-SSSDEAVLI DKQQAAIKYVC^SVYGFTGVAQGFLPCLYVAAT 
LRDWLAMRKAVR \RI PT - STPEEAVLI DKQQ S AIKVI CNS VYGFTGVANGLLPCLRIAAT 
LTSWLAKRKAIK KLLAACEDPRQRT IL DKQQLAIKCTC^AVYGFTGVANGLFPCLSIAET 



L^WLTQRKLTO;ilAMKQCTNEMQRMI^ DKQQIALKVTC^FYGF^GVAAG^^LPCLPIAAS 
LARWLAKRKSVR WLAE C QDEVEKL I L DKYQLALKVTCNAFYGFTGVS SGMMPCLP I AAA 
LNKWVSQRRAVR^C^ECQDPTO 

LKRYLRLRTEHK RALKQ S SGSVAE YH- NRLQNEMKI CTNTHYGVSEHTCSLM 
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Figure 2 (S\ 

REGION 3 cont. 



hHV6 
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hHV6 

hHSVl 

hHSV2 

hVZV 

eHVl 

hEBV 

sHVl 

mCMV 

gpCMV 

hCMV 
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hHV6 

hHSVl 

hHSV2 

hVZV 

eHVl 

hEBV 

sHVl 

mCMV 

gpCMV 

hCMV 

iHVl 



GLTSSDFT 

PE-AADMRAPG — 
PE-AAGKRAPG — 
PDIESSVLSQK — 
PG-AIGFRNHK-- 



VTCLGREMLCSTVDYVNSKMQSEQFFCEEF 

VTT IGREML LATREYVHARWAAFEQLLADF 

VTT IGREML LAT RAYVHARWAE FDQLLADF 

VTTIGRQMLLSTRDYIHNNWAAFERFITAF 

VTT IGRDML LKTRDYVHSRWATRELLEDNF 

VTLQGRTKL ERAKAFVEA-LSPANLQAIAPSPDAWAPLNPEG 

VTLQGRTMLEKSKIFIEA-MTPDTLQEIVPHI VKHEPDA 

I TK IGRDML LATAGH I EDRCNRPDFLRTVL GLPPEAID 

ITRIGRBML ^SWDYVNTYMGHAEFWLRYL g — EEDLT 

ITRIGRI31LERTARFIKDNFSEPCFLHNFFNQEDYVVGTREGDSEESSALPEGLETSSGG 
ITTQGQHKI[ KLVNE F I KTLNRT GHSLFPN 

REGION 1 
«GDTD1B) 



GD LEV EVT YGDTD5 1 FMSVRN VIVNQ S LRR I APMI AKH I TDRL FKS P I KLEFEK I LC P 

PYSly RIIYGDTDSIFVLCRGLTAAGLTAMGDKMASHISRALFLPPIKLECEKTFTK 

P YS>! RI I YGDTDS IFVLCRG LT 3EALVAMGDKMASH I S RALFL P P I KLECEKT FTK 

AYEV KVI YGDTDS VFIRFKG VS VEG I AK I GEKMAH 1 1 STALFC P P I KLECEKT F I K 

P YSV RVI YGDTDSVFIKFVG LT YEGVSELGDAMSRQ I SADL FRAP I KLECEKT FQR 

QLRVIYGDTDSLFIECRGFSESETLRFADALAAHTTRSLFVAPISLEAEKTFSC 

KF RVI YGDTDSLFVECVG FS VDT WKFGDFLAAFT SEKL FNAP I KLES EKT FQ C 

PEALRV — KI I YGDTDS VFAAFYG I DKEALLKAVGALAANVTNALFKEPVRLEFEKMFVS 
GDALNV — KVI YGDTDS VFV I CGG ^KCGS VLEHGEAIAGHITRALFREP I KLEFEKVFVN 
SNERRVEA RVI YGDTDSVFVRFRG LTPQALVARGPSLAHYVTACLFVEPVKLEFEKVFVS 



YGDTDSTMLYHPS DESETQLEDMVTLEDEMRAEL* 



REGION 7 



REGION 5 



L I L I C KKRYI GR- Q DI SLL I FKGVDLVRKTSdDFVKGWKD I VDLL FFDEEVQTAAVEFS 
LLLL? JCKKYIGVIY GG KML-IKGVDLVRKNNC AFINRTSRALVDLLFYDDTVSGAAAALA 
LLLI7 JCKKYIGVIC GG KML-IKGVDLVRKNNC AFINRTSRALVDLLFYDDTVSGAAAALA 
LLLI1 KKKYIGVIY GG KVL -MKGVDL VRKNNC QFINDYARKLVELLLYDDTVSRAAAEAS 
LLL 1 1 KKKYI GVIN GG K^-MKGVDLVRKNNC SFINLYARHLVDLLLYDEDVATAAAEVT 
LMLIT KKRYVGVLT DG KTL-MKGVELVRKTAC KFVQTRCRRVLDLVLADARVKEAASLLS 
LLLL? JCKRYIGILS STL KLL-MKGVDLVRKTAC KFVQNTSSKILNLILKDPEVKAAAQLLS 
LMMI C KKRYI GKVH G£ QNLSMKGVDLVRRTAC G FVKAWS D VLHMVFNDET VS EGTMKL S 
LMMI C KKRYVGRIY 3£ TKLSMKG IELVRKTAC EYVKSTVRNVLNMI FFEDDVSAGAVELS 
LMMI C :KKRYIGKVE SGLSMKGVDLVRKTAC EFVKGVTRDVLSLLFEDREVSEAAVRLS 
-RE'SEHLKKL S* E-LVNKVKEKTKRTD- T FVQ S FL S DV-ET VL FDDMVEK LR 



hHV6 

hHSVl 

hHSV2 

hVZV 

eHVl 

hEBV 

SHVl 

mCMV 

gpCMV 

hCMV 

iHVl 



HMTQTQLREQGVPVGIHKILRRLCEAREELFQNRADVRHl^SSV^ 

ERPAEEWLARPLPEGLQAFGAVLVDAHRRITDPERDIQDFVLTAELSRHPRAYTNKRLAH 
ERPAEEWLARPLPEGLQT^GAVLVDAHRRITDPERDIQDFVLTAELSRHPRAYTNKRIAH 
CVS IAEWNRRAMPSG^GFGRIIADAHRQITSPKLDINKFVMTAELSRPPSAYINRRI^ 
DVPPAEWVGRPLPSGFDKFGRVLVEAYNRITAPNLDVREFVMTAELSRSPESYTNKRLPH 
HRPFQESFTQGLPVGFLPVIDILNQAYTDLREGRVPMGELCFSTELSRKLSAYKSTQMPH 
TKDPDYAFREGLPDGFLKVIDILNESHKNLRTGQVPVEELTFSTELSRPISSYKTENLPH 
RMT FDDLKKNG I P CE FGP WS RLCRARDDLHLKKVP VPELTLS S VL S QEL S C YKQKNLPH 
RMTMDDVKRHGVP S GFYR I VEAL SNARDEL YLNRVDVKKLVL SAS L S QEVSAYKQQNLPH 
RLSLDEVKKYGVPRGFWRILRRLVQARDDLYLHRVRVEDLVLSSVLSKDISLYRQSNLPH 
LFSQGEVIEPFKDGGTWWWDPLTGIVJMD CSTPFSSELICK 
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Figure 2 (6) 



hHV 6 L S V I RRLAQRKEE I PNVGDR IMYVL I AP SI 

hHSV2 LTVYYKLMARRAQVPSIKDRIPYVIVAQTREVEETVARI^^ 

hHS\ _ LTVYYKLMARRAQVPS IKDRI PYVIVAQTREVEETVARLAALRELDAAAP^ 

hVZV LTVYYKLVMRQGQ I PNVRER I PYVI VAPTDEVEADAKS VALLRG DPLQ 

eHVl LTVYFKLAMRNEELPSVKERIPYVIVAQTEAAEREAGVVNSKRG TAQN 

hEBV LAVYQKFVERNEELPQIHDRIQYVFVEP 

sHVl LTVTKKI ITRHEEPPQVHDRIPYVFV 

mCMV LAVI RRLAARKEEL PAVGDRVEYVLT LP D- GCKKN 

gpCMV LRVI QRLAARRPELP S VGDRVPYVL I AP PPGSSKN 

hCMV IAVIKRLAARSEELPSVGDRVFYVLTAPGVRTAPQGSSDNGDSVTAGWSRSDAIDGTDD 

iHVl L 



hHV6 

hHSVl 
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GNKQTH NYELAEDPNYVIEHKIPIHAEKYFDQIIKA 

PSPAKRPRETPSHADPPGG — ASKPRKLLVS-ELAEDPAYAIAHGVALNTDYYFSHLLGA 
PSPAKRPRETPSHADPPGG — ASKPRKLLVS-EIAEDPGYAIARGVPLNTDYYFSHLLGA 

NTAGKRC -GEAK RKLI I S-DLAEDPIHVTSHGLSLNIDYYFSHLIGT 

PWTKTARPQPK RKLLVS -DLAEDPT YVSENDVPLNTDYYFSHLLGT 

KGGVKGARKT EMAE D P AYAERHG VP VAVDHY FDKLLQ G 

GKTTSCIS NMAEDPTYTVQNN I P I AVDL YFDKL I HG 

VPNYE I AEDPRHWEAKL S INAE KYYE QWKA 

VPNYEISEDPGYVIEHKLPVNGEKYFEHWKT 

DADGGGVEESNRRGGEPAKKRARKPPSAVCNYEVAEDPSYVREHGVPIHADKYFEQVLKA 
EYENAS S I GCHVAKKMVS IGST YL 



hHV6 VTNAISPIFPKTDI-KKEKLLLYLLPMKVYLDET F 

hHSVl AC VT FKAL FGN-NAK I TE S LLKRF I PEV- WHPPDDVAARLRAAGF GAV 

hHSV2 ACVTFK^iLFGN-NAKITESLLKRFIPET -WHPPDDVAARLRAAGF GPA 

hVZV ASVTFKALFGN-DTKLTERLLKRFIPET-RWNVKM^^ 

eHVl ISVTFKALFGN-DVRTTENLLKRFIPETPHKTPTKTQALLERAGF EKLTP- 

hEBV AANILQCLFDN-NSGAALSVLQNFTARPPF 

sHVl VANI IQCLF-K-DSSKTVSVLYNFVSTPVLFSYE LL 

mCMV VTNTLMP\rPPR-DMPKREKFFSLWPQRIYIPDQ FL — HLCGNVNELARG 

gpCMV VTNVLGP 1 1 PK-DCARKEKFLSYVLPQRVYVSRP FM — PYACAANELWG 

hCMV VTNVLSPVFPGGETARKDKFLHMVLPRRLHLEPA FL — PYSVKAHE 

iHVl FFKKIS LYHVRVWR MCADTDGSPSHLYF? 



hHV 6 SAIAEVM — 

hHSVl GAGATAEETRRMLHRAFDTL A 

hHSV2 GAGATAEETRRMLHRAFDTL A 

hVZV AE I TEEEQ SHQ IMRRVFC I PKAI LHQ S 

eHVl — FTPEEESRRILHTVFCTLEAAPHQS 

hEBV 

sHVl TDHSVKA 

mCMV GDDSDGGDSEKENMDTERSSSHEAMET 

gpCMV V 

hCMV CC 

iHVl VSLSRTRAKQRGDH 
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hHV6 GTGGTGTTTGATTTTCAMGTTTGTATCCGAGCA^ 

hCMV GCCGTGTTCGACTTTGCCAGCCTCTACCCTTCCATCATCAT6GCCCACMCCTCTGCTACTCCACCCTGCTGGTGCCG 
gpCMV GCCGT(TTCGATTTCGCCAGTCTGTATCCGTCTATCATTATGCGACACMCCTGTGTTACTCGACGTATCTTCCGCTC 
mCMV TGCGTGTTCGATTTCGCCAGTCTGTATCCGTCCATCATCATGTCCAACAATCTGTGCTACTCCACCCTCTTGGTGGAG 

hHSVl GTGGTGTTCGACTITGCCAGCCTGTACCCCAGCATCATCCAGGCCCACAACCTGTGC'n'CAGCACGCTCTCCCTGAGG 
hHSV2 GTGGTGTTTGACTTTGCCAGCCTGTACCCCAGCATCATCCAGGCCCACMCCTGTGCTTCAGTACGCTCTCCCTGCGG 
hVZV GTCGTAnGGATmGCMGmATATCCMGTATMTOAGGCCCATMCTOTGTmACCACGCTMCGTTAAAT 
eHVl GTGGTGTTTGACTTCGCTAGCTTATACCCMGCATTATCCAGGCCCATMCCTCTGTTTCACCACCCTG^^ 

hEBV CTGGTGGTGGACTnGCCAGCCTCTACCCGAGCATCAnCAGGCTCATMTCTCTGnATTCTACCATGATAACGCCG 
sHVl TTAGTAGTAGACTTTGCTAGCCTGTATCCTAGTATTATACMGCTCATAATCTATGCTACTCCACTCTTATACCCCAC 

iHVl CTGTGTCTGGACTTTACCAGCATGTACCCCAGTATGATGTGCGATCTCAACATCTCTCCTGAAACCATCGTGGACAGC 



5 ' -gtgttcgacttygcnagyytntaycc-3 ' 
DFASA 256 -fold 26mer> 

5 ' -gtgttcgacttycaragyytntaycc-3 ' 
DFQSA 128 -fold 26mer> 
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hHV6 GCTCTGAAAACAACATGTMCTCGGTGTACGGTGTCACGGGAGCGGCGCACGGG 

hCMV GCGCTCAAAGTAACGTGCAACGCTTTCTACGGTrTTACCGGCGTGGTCAACGGT 

gpCMV GCCCTCAAAGTGACGTGCAACGCGTTTTACGGTTTCACCGGGGTCAGCAGCGGC 

mCMV GCCCTCAMGTMCGTGCMCGCTTTCTACGGTTTCACGGGGGTAGCGGCCGGG 

hHSVl GCCATCAAGGTCGTGTGTAACTCGGTGTACGGGTTCACGGGAGTGCAGCACGGA 

hVZV GCGATAAMGTAGTTTGTMTTCCGTGTACGGTTTTACTGGAGTTGCGCAGGGA 

eHVl GCGATTAAGGTGATATGCMCTCGGTTTACGGATTCACGGGGGTGGCAAACGGC 

hEBV GCCATCAAGTGCACGTGCMCGCCGTCTACGGCTTCACCGGGGTGGCCAACGGC 

sHVl GCTATTAAAGTAACTTGTAATGCTGTGTATGGGTTTACAGGAGTTGCGTCAGGC 

i HV1 GAAATGAAGATCTGTACAAACACCCACTACGGGGTCTCTGAGCACACGTGTTCG 



5 " -acgtgcaacgcggtgtayggnktnacngg-3 ' 
VYGA 256 -fold 30mer> 

5 ' -acgtgcaacgcggtgtacggsgtsacsgg-3 ' 
VYGCA 8- fold GC-rich 30mer> 

5' -acgtgcaacgcggtgta>-3" 
VYGSQA 17mer> 
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hHV6 GTAATTTATGGTGATACGGATAGCATCTTTATGTCTGTCAGAAAT 

hCMV ATCATCTACGGGGACACGGACTCCATATTTGTGCTGTGCCGCGGC 

gpCMV GTGATATACGGGGACACGGACAGCGTCTTTGTCATATGCGGCGGT 

mCMV ATCATCTACGGCGACACCGACAGTGTGTTTGCGGCTTTCTACGGC 

hHSVl ATCATCTACGGGGACACGGACTCCATATTTGTGCTGTGCCGCGGC 

hVZV GTTATATATGGAGATACGGATTCTGTGTTTATCCGATTCMGGGT 

eHVl GTTATCTACGGAGACACCGACTCCGTGTTTATCAAGTTTGTGGGC 

SHV1 GTCATATATGGAGACACAGACTCTCTATTTGTAGAATGTGTTGGG 

hEBV GTCATCTACGGGGACACGGACTCGCTGTTTATCGAGTGCCGGGGG 

i HV1 CCCAATTATGGGGATACGGATAGTACGATGCTGTACCACCCATCG 



5 ' -tayggngayacngactccgtgtttgtcgcatgccg-3 ' 



3 ' -atrccnctrtgnctgaggcacaaacagcgtacggc-5 ' 

<GDTD1B 64 -fold 35mer 



3' -aggcacaaacagcgtacggc-5' 
<GDTDSQB 20mer 
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Figure 6 



RFHV CGTCGCTTCCGGCATCCTACCGTGCCT6AACATCGCAGAGACGGTGACCCTCCAGGGCAGGAAAAT 

KSHV . . .T. .C. .T A. .G. .T A A. .G C A. .A. .A. .GC.A. .G. . 

eHV2 . . .G. .C. .G G. .C. .T. .C. .G. .A. .C C. . .T CGC. . 

sHVl A. .T. .G. .A. . .T.G. .G. .A. . .A.A.G. . .T T. .T. .T A. . .C. . .CG. . 

EBV G. .G. .CAA. . . .C. .T.T. .C CTC C G. .G C.C.CG. . 



RFHV GCTGGAAACGTCTCAGGCGTTCGTAGAGGGAATCTCGCCAACGGCACTGGCAGACCTACTGCAGCG 

KSHV G.GA C. .T CC GGAACGC. .A. .G.GT. .C. . . .G.A. 

eHV2 G.AC. .CA. .CGC.A.A GG.GA.C. .CGA. .GG A. .T. .GGCA. 

sHVl . . .A. . . .AA. .AA.AATA. . .A. . . .A.C. . .GA.A. .TGATA. . . .TCA. . .AA.TG.T.CT.A 

EBV .T. . . .GCG.G.CA. . . .C G. . . .CCC.GAGC. .CG.CAAC. . .CAG.C. . .GGCC.CCTC 



RFHV ACCGATCGAGGCGTCTCCG GAAGCCAGGTTTAAAGTGATA 

KSHV G. .A. .A. .CTC. .A. .C .C. . .CA. .C .G. .C. . . 

eHV2 G.G.G.G. . .TGCG.C. .C .T T G. .C. .C 

sHVl TATAG.GA. .CATGAA. .T .T. .G.A. . .CG. . .C. . . 

EBV C. . .GA. .CCTG.G.G. .CCTCAACCCC. .G.G.CA.C. .CG. . .C .C 



"." = identical residues 
"•" = deletions 



-» 



WO 97/04105 

Figure 7 



13/28 



PCTAJS96/11688 



RFHV VASGILPCLNIAETVTLQGRKMLETSQAFVEGISPTALADLLQRPIEASP- • -EARFKVI 

KSHV R A...ER..G..R...DV..---D 

eM2 K F. . .R. . .N.KRYI. .VT.EG. . .I.G.RV.CA. - --D.S. . . . 

sHVl . . . .L. . .IS T. . .K.KI.I.AMT.DT.QEIVPHIVKHE. - - -D.K.R. . 

EBV . .N.LF. . .S T. . .RAK. . . .AL. .AN.QA.APS.DAWA.LNP.GQLR. . 



n it 



identical residues 
"-" = deletions 



Class L 



IAETVTL 



Class II: 



CLNIAET SQAFVE 
VASGILP QGRKMLE 
GILPCLN 



ARFKVI 



Class TIT 



LETSQAF ADLLQRP 
LERSQAF AGLLRRP 

EGISPTA QRPIEAS 
EAISPER RRPIDVS 

IEASP- 
IDVSP- 



EA 
DA 



WO 97/04105 



Figure 8 



14/28 



PCT/US96/11688 



RFHV VASGILPCLNIAETVTLQGRKMLETSQAFVEGISPTALADLLQRP IEASPEARFKVI 

KSHV VASG I LPCLN IAETVTLQGRKMLERSQAFVEAISPERLAGLLRRP VDVSPDARFRVI 

eHV2 VASGILPCUCIAETVTFQGRRMLENSKRYIEGVTPEGLADILGRR VECAPDASFKVI 

SHV1 VASGLLPC I S I AETVTLQGRTMLEKSKI FI EAMTPDTLQE I VPH I VKHEPDAKFRVI 

hEBV VANGLFPCLS IAETVTLQGRTMLERAKAFVEALSPANLQALAPSP DAWAPLNPEGQLRVI 

hCMV VVNGMMPCLPIAASITRIGRDMLERTARFIKDNFSEPCFL^^ 

mCMV VAAGMLPCLPIAASITKIGRDMLLATAGHIEDRCNRPDFLRTVLG LPPEAIDPEALRVKI I 

gpCMV VSSGWPCLPIAMITRIGRDMIJISVVDYVNT^ EEDLTGDALNVKVI 

hHV6 - A AAHGLLPCVAIAASVTCLGREMLCSTVDYVNSKMQSEQFFCEEFG LTSSDFTGDLEVEVI 

hVZV VAQGFLPCLYVAATVTTIGRQMLLSTRDYIHNNWAAFERFITAFP DI ESS VLSQKAYYE VKV I 

hHSVl VQHGLLPCLiiVMTmiGREMLUTREWHARWAAFEQLLADFP EAADMRAPGPYSMRI I 

hHSV2 VQHGLLPCLHVMTVTTIGREMLLATRAYVHARWAEFDQLLADFP EAAGMRAPGPYSMRI I 

eHVl VANGLLPCLRIMTmiGRDMLLKTRDYVHSRWATRELLEDNFP GAIGFRNHKPYSVRVI 

hPOLd AQVGKLPCLEISQSVTGFGRQMIEKTXQLVESKYTV ENGYSTSAKVV 

bPOLd AQVGRLPCLE ISQSVTGFGRQM IEKTKQLVETKYTV ENGYSTSAKVV 
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GVASGILPCLNIAETVTLQGRKM 
RFHV GGCGTCGCTTCCGGCATCCTACCGTGCCTGAACATCGCAGAGACGGTGACCCTCCAGGGCAGGAAAATG 
KSHV GGCGTTGCCTCTGGCATACTGCCTTGCCTAAACATAGCGGAGACCGTGACACTACAAGGGCGAAAGATG 
eHV2 GGCGTGGCCTCGGGCATCCTGCCCTGTCTCAAGATAGCCGAGACGGTCACCTTCCAGGGCAGGCGCATG 
sHVl GGAGTTGCGTCAGGCTrGCTGCCATGCATAAGCATTGCAGAGACTGTTACTCTCCAAGGCCGGACGATG 
EBV GGGGTGGCCAACGGCCTCTTTCCCTGCCTCTCCATCGCCGAGACGGTGACGCTGCAGGGCCGCACGATG 

5' -gtcgcctctggcatcctnccntgyctnaa>-3' 5' -cagggccggaagatg 

PCLNA 128 -fold 29mer> KMLEA 32 -fold 

32mer> 



LETSQAFVEGISPTALADLLQRP 
RFHV CTGGAAACGTCTCAGGCGTTCGTAGAGGGAATCTCGCCAACGGCACTGGCAGACCTACTGCAGCGACCG 
KSHV CTGGAGAGATCTCAGGCCTTTGTAGAGGCCATCTCGCCGGAACGCCTAGCGGGTCTCCTGCGGAGGCCA 
eHV2 CTG6AGAACTCCAAGCGCTACATAGAGGGGGTGACCCCCGAGGGGCTGGCAGACATATTGGGCAGGCGG 
sHVl CTAGAAAMTCAAAMTATTCATAGAAGCAATGACACCTGATACACTTCAAGAAATFGTTCCTCATATA 
EBV nGGAGCGGGCCAAGGCCTTCGTGGAGGCCCTGAGCCCCGCCAACCTGCAGGCCCTGGCCCCCTCCCCG 

ctggaracrtcncargc>-3' 
(KMLEA conf d>) 

5 ' -tctcaggcgttcgtagarggnathtcncc-3 ' 
GISPA 96 -fold 29mer> 



IEASP - - - EARFKVI 

RFHV ATCGAGGCGTCTCCG GAAGCCAGGTTTAAAGTGATA 

KSHV GTAGACGTCTCACCC GACGCCCGATTCAGGGTCATA 

eHV2 GTGGAGTGCGCCCCC GATGCCAGTTTTAAGGTCATC 

SHVl GTGAAGCATGAACCT GATGCGAAGTTCAGAGTCATA 

EBV GACGCCTGGGCGCCCCTCAACCCCGAGGGCCAGCTTCGAGTCATC 
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a.o 



x.3 



x.o 



o.o 



-x.o 



-x.a 




SIMQAHNLCY STLITGSALH GHPELTPDDY ETFHLSGGTV HFVKKHVRES LLSKLLTTWL 



AKRKEIRKNL ASCTDPTMRT ILDKQQLAIK VTCNAVYGFT GVASGILPCL NIAETVTLQG 

A A A A A A A A. AA A A A A A A «•» 

RKMLETSQAF VEGISPTALA DLLQRPIEAS PEARFKVI 



- - antigenic 
- hydrophobic 
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1 SIIQAH -Q.68 



SIIQAHNLCY STLIPGDSLH LHPHLSPDDY ETFVLSGGLV HFVKKHKRES LLTKLLTVWL 

^* —-AAA. . 



AKRKEIRKTL ASCTDPALKT ILDKQQLAIK VTCNAVYGFT GVASGILPCL NIAETVTLQG 



RKMLERSQAF VEAISPERLA GLLRRPIDVS PDARFKVI 



- antigenic 

- hydrophobic 



4 
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Figure 13(A) 



CVNVA> 

gacqaccQcaQCQtQtQcat aaavQtnttvaQnca 

DDRSVCVNVFGQRCYFYTLA 
GACGACCGCAGCGTGTGCGT6AAYGTNTTYGGNCAGCGCTGCTACTTCTACACACTAGCA 60 

PQGVNLTHVLQQALQAGFGR 
CCCCAGGGGGTAAACCTGACCCACGTCCTCCAGCAGGCCCTCCAGGCTGGCTTCGGTCGC 120 

ASCGFSTEPVRKKI LRAYDT 
GCATCCTGCGGCTTCTCCACCGAGCCGGTCAGAAAAAAAATCTTGCGCGCGTACGACACA 180 

QQYAVQKITLSSSPMMRTLS 
CAACAATATGCTGTGCAAAAAATAACCCTGTCATCCAGTCCGATGATGCGAACGCTTAGC 240 

DRLTTCGCEVFESNVDAIRR 
GACCGCCTMCAACCTGTGGGTGCGAGGTGTTTGAGTCCAATGTGGACGCCATTAGGCGC 300 

FVLDHGFSTFGWYECSNPAP 
TTCGTGCTGGACCACGGGTTCTCGACATTCGGGTGGTACGAGTGCAGCAACCCGGCCCCC 360 

RTQARDSWTELEFDCSWEDL 
CGCACCCAGGCCAGAGACTCTTGGACGGAACTGGAGTTTGACTGCAGCTGGGAGGACCTA 420 

KFIPERTEWPPYTILSFDIE 
MGTTTATCCCGGAGAGGACGGAGTGGCCCCCATACACAATCCTATCCTTTGATATAGAA 480 

CMGEKGFPNATQDEDMI IQI 
TGTATGGGCGAGMGGGTTTTCCCAACGCGACTCAAGACGAGGACATGATTATACAAATC 540 

SCVLHTVGNDKPYTRMLLGL 
TCGTGTGTTTTACACACAGTCGGCAACGATAAACCGTACACCCGCATGCTACTGGGCCTG 600 

GTCDPLPGVEVFEFPSEYDM 
GGGACATGCGACCCCCTTCCTGGGGTGGAGGTCTTTGAGTTTCCTTCGGAGTACGACATG 660 

LAAFLSMLRDYNVEFITGYN 
CTGGCCGCCTTCCTCAGCATGCTCCGCGATTACAATGTGGAGTTTATAACGGGGTACAAC 720 

IANFDLPYI IARATQVYDFK 
ATAGCAAACTTTGACCTTCCATACATCATAGCCCGGGCAACTCAGGTGTACGACTTCAAG 780 

LQDFTKIKTGSVFEVHQPRG 
CTGCAGGACTTCACCAAAATAAAAACTGGGTCCGTGTTTGAGGTCCACCAACCCAGAGGC 840 

GSDGGNFMRSQSKVKISGIV 
GGTTCCGATGGGGGCAACTTCATGAGGTCCCAGTCAAAGGTCAAAATATCGGGGATCGTC 900 

PIDMYQVCREKLSLSDYKLD 
CCCATAGACATGTACCAGGTTTGCAGGGAAAAGCTGAGTCTGTCAGACTACAAGCTGGAC 960 
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Figure 13(B) 



TVAKQCLGRQKDDISYKDIP 
ACAGTGGCTAAGCAATGCCTCGGTCGACAAAAAGATGACATCTCATACAAGGACATACCC 1020 

P L F K SGPDGRAKVGNYCVID 
CCGCTTTTTAMTCTGGGCCTGATGGTCGCGCAMGGTGGGAMCTACTGTGTTATTGAC 1080 

SVLVMDLLLRFQTHVEISEI 
TCGGTCCTGGTTATGGATCTTCTGCTACGGTTTCAGACCCATGTTGAGATCTCGGAAATA 1140 

AKLAKIPTRRVLTDGQQIRV 
GCCMGCTGGCCAAGATCCCCACCCGTAGGGTACTGACGGACGGCCAACAGATCAGGGTA 1200 

FSCLLEAAATEGYILPVPKG 
TTnCCTGCCTCnGGAGGCTGCTGCCACGGMGGTTACATTCTCCCCGTCCCAAAAGGA 1260 

DAVSGYQGATVISPSPGFYD 
GACGCGGTOGCGGGTATCAGGGGGCCACTGTMTMGCCCCTCTCCGG^TTCTAT^ 1320 

DPVLVVDFASLYPSI IQAHN 
GACCCCGTACTCGTGGTGGATTTTGCCAGCTTGTACCCCAGTATCATCCAAGCGCACAAC 1380 

LCYSTLIPGDSLHLHPHLSP 
nGTGCTACTCCACACTGATACCCGGC(^nCGCTCCACCTGCACCCACACCTCTCCCCG 1440 

DDYETFVLSGGPVHFVKKHK 
GACGACTACGAMCCTTTGTCCTCAGCGGAGGTCCGGTCCACTTTGTAAAAAAACACAAA 1500 

RESLLAKLLTVWLAKRKEIR 
AGGGAGTCCCTTCTTGCCAAGCTTCTGACGGTATGGCTCGCGAAGAGAAAAGAAATAAGA 1560 

KTLASCTDPALKTI LDKQQL 
AAGACCCTGGCATCATGCACGGACCCCGCACTGAAAACTATTCTAGACAAACAACAACTG 1620 

AIKVTCNAVYGFTGVASGIL 
GCCATCMGGTTACCTGCMCGCCGTTTACGGCTTCACGGGCGTTGCCTCTGGCATACTG 1680 

PCLN IAETVTLQGRKMLERS 
CCTTGCCTAAACATAGCGGAGACCGTGACACTACAAGGGCGAAAGATGCTGGAGAGATCT 1740 

QAFVEAISPERLAGLLRRPV 
CAGGCCTTTGTAGAGGCCATCTCGCCGGAACGCCTAGCGGGTCTCCTGCGGAGGCCAGTA 1800 

DVSPDARFKVIYGDTDSLFI 
GACGTCTCACCCGACGCCCGATTCAAGGTCATATACGGCGACACTGACTCTCTTTTCATA 1860 

CCMGFNMDSVSDFAEELASI 
TGCTGCATGGGTTTCAACATGGACAGCGTGTCAGACTTCGCGGAGGAGCTAGCGTCAATC 1920 

TTNTLFRSPIKLEAEKIFKC 
ACCACCAACACGCTGTTTCGTAGCCCCATCAAGCTGGAGGCTGAAAAGATCTTCAAGTGC 1980 
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Figure 13(C) 



LLLLTKKRYVGVLSDDKVLM 
CTTCTGCTCCTGACTAAAAA6AGATACGTGGGGGTACTCAGTGACGACAAGGTTCTGATG 2040 

KGVDLIRKTACRFVQEKSSQ 
MGGGCGTAGACCTCATTAGGAAMCAGCCTGTCGTTTTGTCCAGGAAAAGAGCAGTCAG 2100 

VLDLILREPSVKAAAKLISG 
GTCCTGGACCTCATACTGCGGGAGCCGAGCGTCAAGGCCGCGGCCAAGCTTATTTCGGGG 2160 

QATDWVYREGLPEGFVKI IQ 
CAGGCGACAGACTGGGTGTACAGGGAAGGGCTCCCAGAGGGGTTCGTCAAGATAATTCAA 2220 

VLNASHRELCERSVPVDKLT 
GTGCTCAACGCGAGCCACCGGGAACTGTGCGAACGCAGCGTACCAGTAGACAAACTGACG 2280 

FTTELSRPLADYKTQNLPHL 
TTTACCACCGAGCTAAGCCGCCCGCTGGCGGACTACAAGACGCAAAACCTCCCGCACCTG 2340 

TVYQKLQARQEELPQIHDRI 
ACCGTGTACCAAAAGCTACAAGCTAGACAGGAGGAGCTTCCACAGATACACGACAGAATC 2400 

PYVFVDAPGSLRSELAEHPE 
CCCTACGTGTTCGTCGACGCCCCAGGTAGCCTGCGCTCCGAGCTGGCAGAGCACCCCGAG 2460 

YVKQHGLRVAVDLYFDK 
TACG1TAAGCAGCACGGACTGCGCGTGGCGGTGGACCTGTATTTCGACAAG 2511 

atraarctrttygacgaggtgcctcatcgatt 
(<YFDKB) 
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Figure 14 



TR1910JCSH KVLMKGVDLIRKTACRFVQEKSSQVLDLILREPSVKAAAKLISGQATDWVYREGLPEGFV 

TR2108EBV laLMKGVELVRKTACKFVQTRCRRVLDLVLADARVKEAASLLSHRPFQESFTQGLPVGFL 

TR2109_eHV RLLMKGVDLVRKTACRFVQETTXAILDLVMGDEAVRAAAERLCAMRVEEVCARGPPVGFL 

TR2110_hHS KMLIKGVDLVRKNNCAFINRTSRALVDLLFYDDTVSGAAAALAERPAEEWLARPLPEGLQ 

TR2111JWZ KVLMKGVDLVRKNNCQFINDYARKLVELLLYDDTVSRAAAEASCVSIAEWNRRAMPSGMA 

TR2112_hHV LLI FKGVDLVRKTSCDFVKGVVKDI VDLLFFDEEVQTMVEFSHMTQTQLREQGVPVGI H 

TR2113_hCM GLSMKGVDLVRKTACEFVKGVTRDVLSLLFEDREVSEAAVRLSRLSLDEVKKYGVPRGFW 

TR2114JBV KTLMKGVELVRKTACKFVQ-RCRRVLDLVLADARVKEAASLLSHRPFQESFTQGLPVGFL 

TR2115_sHV KLLMKGVDLVRKTACKFVQNTSSKI LNLI LKDPEVKAAAQLLSTKDPDYAFREGLPDGFL 

TR1910JCSH KIIQVLNASHRELCERSVPVDKLTFTTELSRPLADYKTQNLPHLTVYQKLQARQEELPQI 

TR2108JBV PVIDILNQAYTDLREGRVPMGELCFSTELSRKLSAYKSTQMPHLAVYQKFVERNEELPQI 

TR2109_eHV KVVDILNDSYRKLRLNRVPVGQLSFSTELSRPISYYKTLTLPHLVVYHKIMQRNEELPQI 

TR2110_hHS AFGAVLVDAHRRITDPERDIQDFVLTAELSRHPRAYTNKRU^HLTVYYKLMAR 

TR2111_hVZ GFGRIIADAHRQITSPKLDINKFVMTAELSRPPSAYINRRLAHLTVYYKLVMRQGQIPNV 

TR2112_hHV KI LRRLCEAREELFQNRADVRHLMLSSVLSKEMAAYKQPNLAHLSVIRRLAQRKEE I PNV 

TR2113_hCM RILRRLVQARDDLYLHRVRVEDLVLSSVLSKDISLYRQSNLPHIAVIKRLAARSEELPSV 

TR2114_EBV PV I DI LNQAYTDLREGRVPMGE LC FSTELSRKLSAYKSTQMPHLAVYQKFVERNEE LPQ I 

TR2115_sHV KVIDILNESHKNLRTGQVPVEELTFSTELSRPISSYKTENLPHLTVYKKIITRHEEPPQV 

• • • • **• * • *• * • * • * • 

TR1910_KSH HDRIPYVFVDAP 

TR2108JBV HDRIQYVFVEPKGG 

TR2109_eHV HDRIAYVFVQSPK 

TOllOJiHS KDRIPYVIVAQTREVEETVARLAALRELDAAAPGDEPAPPAALPSPAKRPRETPSPADPP 

TR2111JiVZ RERIPYVIVAPTDEVEADAKSVALLRGDPLQN TAGKR 

TR2112_hHV GDRIMYVLIAPSIGNKQ 

TR2113_hCM GDRVFYVLTAPGVRTAPQGSSDNGDSVTAGVVSRSDAIDGTDDDADGGGVEESNRRGGEP 

TR2114JBV HDRIQYVFVEPKGG 

TR2115_sHV HDRIPYVFVGKT 

e'A'c A A 

TK1910_KSH GSLRSELAEHPEYVKQHGLRVAVDLYFDK 

TR2108_EBV VKGARKTEMAEDPAYAERHGVPVAVDHYFDKLLQGAAN I LQCLFDNNSGAAL 

TR2109_eHV GKLRSEMAEDPAYAAQHN I PPAVDLYFDKV I HGAAN I LQCLFENDSDKAA 

TR2110_hHS G- - -GASKPRKLLVSELAEDPAYAIAHGVALNTDYYFSHLLGAACVTFKALFGNNAKITE 

TR2111JiVZ - - — CGEAKRKLIISDUEDPIHVTSHGLSLNIDYYFSHLIGTASVTFKALFGNDTKLTE 

TR2U2_hHV THN YELAEDPN YV I EHKI P I HAEKYFDQ 1 1 KAVTNAI SP I FPKTD I K - K 

TR2113_hCM AKKRARKPPSAVCNYEVAEDPSYVREHGVPIHADKYFEQVLKAVTNVLSPVFPGGETARK 

TR2114JEBV VKGARKTEMAEDPAYAERHGVPVAVDHYFDKLLQGAAN I LQCLFDNNSGAAL 

TR2115_sHV TSCISNMAEDPTYTVQNNIPIAVDLYFDKLIHGVANIIQCLFK-DSSKTV 

-t-_i. ^k_t- _■■■>. 
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RFHVMn VASGILPCLNIAETVTLQGRKMLETSQAFVEGI S PTALADLLQRP IEAS P EARFKVI 

RFHVMm KD.S..I....D... D 

KSHV-KS7 .R A. . . ER . . G . . R . .VDV. . D . . . R . . 

KSHV-KSF2 R A . . . ER . . G . . R . . . DV . . D 

KSHVpathl R A. . . ER . . G . . R . .VDV. . D 

KSHVpath2 R A. . . ER . . G . . R . .VDV. . D . . . R . . 

eHV2 K F...R.. .N.KRYI. . VT . EG . . . I . G . RV. CA. D.S 

SHVl . ... L ... IS T. . . K . KI . I . AMT . DT . QE IVPHIVKHE . D.K.R. . 

EBV . .N.LF. . .S T. . .RAK AL. . AN . QA . APS . DAWA . LNP . GQLR . . 
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Figure 16 



( DFASA> ) gtcrt t cqa ctt vacna a vvtnt a vcc 1 
RFMn (QAHNA» ccaaatatcathcaracncavaa CCTCTGTrATTrTAr 

RFMm ...A C 

KS .T.G..C..C..C. 



20 40 60 80 

RFMn CCT6ATTACAGGAAGCGCCCTACACGGGCACCCCGAACTGACCCCCGACGACTACGAAACCTTCCACCTG 

RFMm CAG..G.A....A.T.T.TC GT GA A A 

KS A AC.C. .CGATT.G. .C. . .CT AC.C. .CT. . . .G TGT. . .C 



100 120 140 

RFMn AGCGGGGGMCGGTACACTTTGTAAAAAAGCACGTCCGCGAGTCACTACTGTCCAAACTGCTCACAACAT 

RFMm A. . .C. . . .G C A A. .A T A T. .G. .G. .T. 

KS A. .TC. . . .C A. . .AAAA.G C. .T. .TA. . . .G. .T. .G. .GGT. . 
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